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WO 97/49822 PC7YEP97/01218 
GENE CONFERRING DISEASE RESISTANCE IN PLANTS AND USES THEREOF 



The present invention relates to disease resistance in Diants and to identifying and 
oreeoing Disease resistance into plants. More particularly, the present invention relates to 
the identification, isolation and characterization of a gene involved ;n oroao soectrum 
disease resistance in plants. 

Plants are constantly chaiienged Dy a wide variety of oatnogenic organisms 
inducing viruses, bacteria, fungi, ana nematodes. Crop Diants are particularly vuineraole 
because they are usually grown as genetically-uniform monocultures: wnen disease strikes. 

losses can oe severe. 

However, most plants have their own innate mechanisms c? defense against 
pathogenic organisms Natural variation for resistance to olant pathogens r.as Deen 
identifiec cy plant oreeaers and pathologists and bred into many crop plants. These natural 
disease resistance genes often provide nigh levels of resistance to or immunity against 
patnogens. 

In many Diant soecies an initial inoculation by a necrotizing pathogen can *mmumze 
the plant to subsequent infection. Tnis acquirea disease resistance was f.rsi documented in 
1901 ana is thought to play an important role in the preservation c* plants m nature. 
Particular weii cnaractenzed examples of olant immunity a r e the pnencmenon or systemic 
acquirea resistance (SAR) ana induced resistance -n plants such as tccacco A~aaujcs,s 
and cucumoer. In these systems, inoculation by a necrotizing pathogen results in systemic 
protection against subsequent infections oy that cathogen as well as a numDer of otner 
agronomicaliy important bacterial, fungai ana vir 3 ; catnogens 
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System ac qmM res.stance can a,sc be ,r, gg e,ed by chemical ; mmu „, 2a(ion 
compounds, certain chemicals tha, ,„ du ce ,„. , mmun „ y fesponse ^ ^ 
compound, can be „, nature, o rlg ,n. such as sa„c y „c a« ,SA,. or can be s y n,he„c 
chem,ca,s. such as 2 .6-d,ch,o,o,son,co„n,c ac,d (1NA) and ben Z o„.2.3„h,ad,azole-7- 
carbo,h,o,c ac, S-me,h yl ester (BTH). Treatment w„h . pathogen or an immun , 2a „ on 
compound ,n d uces , h . expression „, ., „,„. sets 0 , genes , n , o0acco ^ oesi 
=hara_ spaces. D.tteren, numbers an. , ype s o, g enes can b8 expressed ,n other 
P.an,s. The level o, ,nduc„on tor SAR-related g enes induced b y ,mmuni 2 a,,on compounds 
as h, 3 h as ,0.000-™ over background. ,„ parllcula , SAR cha , ac , eri2ed fcy ^ 
expression of SAB genes. ,nc,ud,ng t he pathoaenes.s-related ,P R > genes. 

The SAR genes are mduced )o „ owing ,„„„„„„ by , pa|hogen ^ ^ ^ 
genes nave a role ,n proving s y s,em,c acgu.red res-stance ,o ,n. plan,. These p,an, 
P-ote.ns are induced in ,a, g e amounts in response ,o ,n,ec„on by various pathogens 
■nciud.ng v.ruses. bacteria an d lungi. PH p, 0(eins wefe )lrs , di5covered ^ ^ ^ 

**" a reaC " n9 -ly ,0 intecon w„h tobacco mosa.c v„us ,TMV, 

< ,983, Neth J Pjanr Pathol 89: 2,5-25,; Van Loon „ 985 , Plant MoL gioN 4: 111-116- and 
-nes e, a,. ( , 992) P^ Ceil 4: .45-56, Such proteins are beiieved ,o be a common 
delens.ve systemic response of plants to mfection by pathogens 

Pa,hogenes,s-re,a,e d prote.ns rnclude b„, are no, ,im„e d to SARe 2a and SAPe 2b 
pro,e,ns. the acd.c and bas.c torms o, tobacco PP,a. PR.,b. and PR, C . PR . r PR 2 pR 
PR*-. PP. N . pr-o. PR . 0 .. PR ., pR . p pR _ Q pR s ^ pR r ^ ^ ^ 

Peroxidases; bas,c cucumber perox.dase; the ch„,nase wH.cn ,s a bas.c counterpart o, PR-P 
or PR-O; the beta- ,.3- g ,ucanase .glucan endo- ,3-be,a- g ,ucos,dase. EC 3.2 ,.39, which ,s 
a bas,c counterpart o, PR-2. PR. N Br PR . 0: ana palnogen . in0uc , b|e 
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cucumber. Such PR proteins are disclosed, for example, in Uknes et al. (1992) The Plant 
Cell 4: 645-656 and the references cited therein. 

SAR or SAR-like genes are expressed in ail plant species exhibiting systemic 
acquired resistance. Expression of such genes can be determined by probing with known 
SAR DNA sequences. Por example, see Lawton et al. (1 992) Proceedings of the Second 
European Federation of Plant Pathology (1983), In: Mechanisms of Defence Responses in 
Plants, B. Fritig and M. Legrand (eds), Kiuwer Academic Publishers, Dordrecht, pp. 410- 
420; Uknes et at. (1992) The Plant Cell 4: 645-656; and Ward et al. (1991) The Plant Cell 3: 
1085-1094. Methods for hybridization and cloning are well known in the art. See, for 
example. Molecular Cloning, A Laboratory Manual , 2nd Edition, Vol. 1-3, Sambrook et al. 
(eds.) Cold Spring Harbor Laboratory Press (1989) and the references cited therein. 

Alternatively such SAR or SAR-like genes can be found by other methods such as 
protein screening, +/- screening, etc. See, for example, Liang and Pardee (1992) Science 
257: 967-971; and St. John and Davis (1979) Cell 16: 443. 

Despite much research and the use of sophisticated and intensive crop-protection 
measures, including genetic transformation of plants, losses due to disease remain in the 
billions of dollars annually. Disease resistance genes have previously been cloned but 
transgenic plants transformed with these genes would typically be resistant only to a subset 
of strains of a particular pathogen species. Despite efforts to done genes involved in SAR, 
a gene controlling broad spectrum disease resistance has not been isolated and 
characterized. 

Several lines of evidence indicate that endogenously produced SA is involved in the 
signal transduction pathway coupling the perception of pathogen infection with the onset of 
SAR. Mutants which retain the ability to accumulate SA in response to pathogen yet have 
lost the anility to induce SAR genes or resistance after application of SA or INA have been 



SUBSTITUTE SHEET (RULE 26) 



W O 97/49322 



PCT7EP97 01218 



described by Delanev pr ai Pm^ m »• » 

' a '" ££2£J^alLAca i _Sc ! _92: 6602-6606 (,995) and ,n 
W094,, 6077 ,he »h„,e * which ,s ™, WfalM „ erein by re(erence 

has now ,e.„ d.scovered ,ha, these mutants contain a mutan, gene, which g ene 
■n » w,d, y pe torm controls SAR gene expression an. SAR „ se , The presen, inversion 
-osn.es ,ha, the mutan, gene con,.™ broad spec,™ disease SU scep, Mi , y ,o mu ,an, 
Cants and renders them non,„ducib,e ,0 pathogens and chemical inducers. 

The presen, ,nven,,on concerns ,ne ,den„„ca„on. ,so,a„on and charac,e„za„on o, 
<he w„d, yP e lMUI) ge „e. a g ene wh.ch al.ows activat.on ,„ a p,a„, „ SAR and SAR g ene 
expression ,n response lo b,olog,cal and chemical inducers. 

A mu,an, gene has been identity ,n Mu,a g en l2 ed A r* bldop s,s p,an,s. These P ,a n ,s 
have been ,ound ,o he detective in ,heir norma, response ,o pa.hogen Section ,n ,ha, , h e y 
- no, express genes associated w„h s y s,em,c acquired resistance (S AR, nor are ,he y 

capable of exhibiting SAR Thesp m,,ta^ . 

9 H. These mutants contam a defective gene which has been 

labelled n,m1 (for noninduc.ble immunity). 

The presen, ,nven„on also concerns ,he use o, ,he Coned NIM1 gene and variants 
•hereo, ,o craa,e „a„sgen,c pian.s ,ha, have broad specum d,sease resistance and ,o ,„e 
transgenic p.ants produced ,hereb y . The ,n y en„on ,ur,her concerns ,he use o, ,he Coned 
N,., gene and vanan.s .hereo, ,n a screen.ng method ,or Iden,,^ compounds capab.e 
o, inducing broad speclrum disease resistance in plants. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the effect of chemical inducers on the induction of PR gene 
expression in wild-type and nim1 plants. 

Figure 2 depicts PR-1 gene expression in pathogen-mfected Ws-0 ana ntml piants 
over the course of 6 days from the initiation of infection. 

Figure 3 shows the levels of SA accumulation in Ws-O and nim1 plants infected with 
P. synngae. 

Figure 4 shows the genetic map of the NIM1 region as determined by AFLP and 
SSLP analysis. 

Figure 5 depicts a physical map of the NIM1 region as determined by YAC clone 
analysis. 

Figure 6 shows a physical map of an extended P1/BAC contig. 

Figure 7 shows a physical map setting forth the positions of P1 and BAC clones with 
respect to the flanking AFLP markers and YACs. 

Figure 8 shows a physical map of a further extended P1/BAC contig containing the 
NIM1 gene. 

Figure 9 shows an integrated genetic and physical fine map of the NIM region 
Figure 10 shows an integrated map of the NIM1 region. 

Figure 1 1 shows an integrated map of the N/M1 region including the new AFLP 
markers. 

Figure 12 is a schematic representation of recombinants D169 and C105. 

Figure 13 is a global map of the chromosomal region centered on NIM1 with 
recombinants indicated, including, BACs, YACs and Cosmids m NIM1 region. 

Figure 14 provides the sequence of the 9 9kb region of clone BAC-04 containing the 
NIM 1 gene. 
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F. 9 ure ,5 shows me nucle.c acd sequence of the N!M1 gene and the am,no acd 
sequence of .he N,M, gene product. ,nc,ud,n 9 changes in ,he various atfeles. 

Figure 16 shows the expression o, M«, ,„ dU ced by INA, BTH. SA and pathogen ,„ 

wiid type and mutant alleles of nimh 

F.gure ,7 shows the expression of PR-, ,n „, m , r„ utan ,s and witd-type ptents . 
Figure ,e shows disease resislance in various n,ml mutants. 

Figure ,9 is an am,„o acd sequence companson o, Expressed Sequence Tag 
regions o, the prote.n and cONA prote.n products o, 4 r.ce gene sequences (see S E o 

ID NO: 3). 



DEFINITIONS 

AA - Amino Acid 

AFLP: Ampl.fied Fragment Length Polymorphism 

avrRpt* aviruience gene R P t2. isolated from Pseudomonas synngae 

BAC: Bacterial Artificial Chromosome 

BTH. b enzo( 1 ,2,3),hi a dia2ole-7-c a rboth IO ic acd S-methy, ester 

Col: Arabidopsis ecotype Columbia 

ECs: Enzyme combinations 

INA. 2,6-dichloroisonicotinic acid 

Ler. Arabidopsis ecotype Landsberg erecta 

NIM1. the wildtype gene, conferring disease res.stance to the p lan t 

^ mUtant all6le ° f N,M1 ' COnfer -9 disease susceptibility to the plant 

mrrtV. mutant plant line 

ORF: open reading frame 

PCs: Primer combinations 
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SA: salicylic acid 

SAR. Systemic Acquired Resistance 

SSLP: Simple Sequence Length Polymorphism 

Ws-O A rabtdopsis ecotype Wassilewskija 

YAC: Yeast Artificial Chromosome 



The NIM1 gene has been cloned by mapping and walking techniques which indicate 
that the gene is contained in a -105 Kb region. (See Figure 13 and Table 16). This region 
is delineated by the L84.6b marker on the left and the L84.T2 marker on the right. Only 
three overlapping cosmids made from wild-type DNA from the 105 Kb region complement 
the niml mutant phenotype (Figure 13 and Table 16). These three cosmids only overlap in 
a 9.9 Kb region defined by the left end of cosmid clone D7 and the right end of cosmid D5 
as pictured in Figure 13. Many other cosmids made to other areas of the 105 Kb region do 
not complement the niml phenotype (Figure 13 and Table 16). A near full length cDNA 
clone to the NIMi gene indicates the appropriate intron-exon borders and defines the amino 
acid sequence of the gene product. Only the NIM1 gene region within the 9.9 Kb 
complementing region has sequence changes in various niml mutant alleles (Table 18). 
Three other potential gene regions showed no sequence changes that are assosciated with 
the niml phenotype. The sequence changes found in the NIM1 gene region are consistent 
with altered function or loss of function of the gene product. The severity of the change to 
the NIM1 gene region in a particular mutant allele is roughly correlated with the observed 
physiological seventy of that niml allele. Only the NIM1 gene region had detectable RNA 
(transcription) and this RNA showed abundant changes consistent with the physiolog.cal 
ro ( e of NIM1 in pathogenesis (Table 18 and Figure 16). 
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is 

NIM1 
in 



The present invention relates to an isolated gens fragment. the NtM , gene whjch 
a Key component of the systemic acquired resistance (SAR) pathway ,n p ,an,s. The 
gene ,s assocated w,.h the acivation o. SAR by chemical and biological inducers and 
conjunction with such inducers. ,s required for SAR and SAR gene express.on. 

The location o, the MA, gene ,s determined by molecular biologica, ana,ys,s o, the 

extreme sensit.vity to a w.de variety o, pathogens and renders them unable to respond to 

pathogens and chemical inducers of SAR. 

N, m , mutants are useful as 'universal disease susceptibfe- ( UDS) plants by virtue of 
•heir being susceptibfe ,o many strains and pathotypes o, pathogens o, the host plan, and 
atso to pathogens which do no, normal.y infec, the host p,ant. bu, which infec, other hos,s 
They can be generated by ,rea„ng seeds or other b,o,o 9 ,ca, materia, with mutagen, agents 
and then selecting progeny p la n,s tor ,he UDS pheno, y pe by .reating progeny P ,an,s 
Known chemical inducers (e.g. INA) o, the sys.emic acquired response and ,hen ,n f ec, 
•he plants w,h a Known pathogen. Noninducible mutants deve,op severe disease symptoms 
under these circumstances, whereas non-mutants are induced by the chem,ca, compound 
-c system, acquired resistance. mm mu.an.s can be egua.ly se.eced trom mutan, 
PO PU ,a„ons generated by chemica, and ,rrad,a«,on mutagenesis, as we,, as ,rom popu,a„ons 
generated by T-DNA insertion and .ransposon-induced mutagenesis. 

Techniques ,o, generating mutan, plan, lines are well known ,„ tha an The mm 

plant phenotype ,s used as a loo, to identify an 

' aen '"y an isolated gene iragment which allows 

exp,ess,on ol broad spectrum disease resistance in plants 

Comprised by the present invention ,. an isolated DNA molecule comprising a mutan, gene 
of the NIM1 gene which is a niml gene. 



with 

ing 
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Following the use of a niml mutant or plant to isolate the wild-type NIM1 gene 
necessary for constitutive expression of SAR genes, the resistance trait, in combination with 
other cnaractenstics important for production and quality, can be incorporated into plant 
lines through breeding. Breeding approaches and techniques are known in the art. See, (or 
example, Welsh J. R. ( Fundamentals of Plant Genetics and Breeding . John Wiley & Sons, 
NY (1981); Crop Breeding, Wood D. R. (Ed.) American Society of Agronomy Madison, 
Wisconsin (1983); Mayo O., The Theory of Plant Breeding . Second Edition, Clarendon 
Press, Oxford (1987); Singh, D.P., Breeding for Resistance to Diseases and Insect Pests . 
Spnnger-Verlag, NY (1986); and Wricke and Weber, Quantitative Genetics and Selection 
Plant Breeding , Walter de Gruyter and Co., Berlin 1986). 

A further object of the invention is a chimeric gene comprising a promotor active in plant 
operably linked to a heterologuous DNA molecule encoding the aminoacid sequence of a 
NIM1 gene product and variants thereof according to the invention. 

Methodologies for the construction of plant expression cassettes as well as the 
introduction of foreign DNA into plants is generally described in the art. Generally, for the 
introduction of foreign DNA into plants, Ti plasmid vectors have been utilized for the delivery 
of foreign DNA. Also utilized for such delivery have been direct DNA uptake, liposomes : 
electroporation, micro-injection, and microprojectiles. Such methods had been published in 
the art. See, for example, Bilang et al. (1991) Gene 100: 247-250; Scheid et al.. (1991) Moi. 
Gen, Genet. 228: 104-1 12; Guerche et al., (1987) Plant Science 52: 1 1 1 116; Neuhause et 
al., (1987) Theor. Appl. Genet. 75: 30-36; Klein et al., (1987) Nature 327: 70-73; Howell et 
al., (1980) Science 208:1265: Horsch et al., (1985) Science 227: 1229-1231; DeBlock et al., 
(1989) Plant Physiology 91 : 694-701 ; Methods for Plant Mo lecular Biology (Weissbach and 
Weissbach, eds.) Academic Press, Inc. (1988); and Meth ods in Plant Molecular Biology 
(Schuler and Zielinski, eds.) Academic Press. Inc. (1989) See also US. patent application 
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S.n., Nos 08M3B.666 Med May ,0, , 995 . an, WO 93,0 7278 . Soth „, whlch afe 
mcorporated here.n by reterence ,„ ,ne,r entirety. „ un<Jers , Md ^ ^ ^ 
.ransformat.on will depend upon the plan, cell ,o be transformed. 

« <ur.her recognized ,ha, the component „, , he express.on ca S se,,e may be 
mod.t.ed ,o .ncrease expression For example, uuncated seguences. „u C leo„de 
subshtut.ons or o,he, mod„,ca„ons may be employed. Plan, cells .reformed w„h sucn 
mod„,ed express,on systems, men. would exhibit overexpress.on o, cons,„u„ve express.on 
of SAR genes necessary for activat.on of SAR. 

The DNA molecule or gene Iragmen, conlerr.ng d.sease resistance ,. p,a„,s by 
aNowing induction o, SAR oene express.on can be incorporated in P ,a„, o, oacter.al cells 
us,n 9 convent.ona, recomb.nan. DNA technology Generally. ,h,s .nvolves msen.ng ,„e DNA 
-ecule ,n,o an express.on sys.em ,o wh.ch the DNA molecule Is heterologous „ e no, 
normally presen,, The heterologous ONA molecule Is Inserted into ,e express.on system 
or vector ,n proper or,en,a„on and correct read.ng trame. The vector conta.ns the necessary 
elements to, the ,ranscr,p„o„ and translate o, the inserted prote.n-cod.ng seguences A 
■arge number o, vec.o, systems Known ,n the ar, can be used, such as plasm.ds 
bactenopnage v.ruses and other mod.Hed v.ruses. Suitable veclors mclude. bu, are no, 
«o. vira, vectors such as lambda vector systems Ig.n. Igtlo and Charon <; plasmid 
vectors such as p B , 12 ,. pBR322 , pACYC ,„ pACYC1 ^ par series P KK 22 3-3 

s.m,iar systems. The ONA seguences can be cloned into the vector us.ng standard cloning 

procedures in the art. asdescribedbyMan.at.se. a . M„,» , 

y "an.alis et a... Molecular Cloning: A I ahn, a in , v 

Manual, Cold Spring Laboratory. Cold Spnng Harbor. New York „98 2 , 

A further o b ,ec, o, the ,nven„on a recomb.nan, vector compr^g , he ch.mer.c gene 

according to the invention. 
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In order to obtain efficient expression of the gene or gene fragment of the present 
invention, a promoter must be present in the expression vector. RNA polymerase normally 
binds to the promoter and initiates transcription of a gene. Promoters vary in their strength, 
i.e., ability to promote transcription. Depending upon the host cell system utilized, any one 
of a number of suitable promoters can be used. Suitable promoters include ubiquitin, nos 
promoter, the small subunit ribulose bisphosphate carboxylase gene promoter, the small 
subunit chlorophyll A/B binding polypeptide promoter, the 35S promoter of cauliflower 
mosaic virus, and promoters isolated from plant genes. See C.E. Vallejos. et al. t 
"Localization in the Tomato Genome of DNA Restriction Fragments Containing Sequences 
Homologous to the rRNA (45S), the major chlorophyll ^Binding Polypeptide and the 
Ribulose Bisphosphate Carboxylase Genes," Genetics 1 12: 93-105 (1986), which discloses 
the small subunit materials. The nos promoter and the 35S promoter of cauliflower mosaic 
virus are well known in the art. 

Once the disease resistance gene of the present invention has been cloned into an 
expression system, it is ready to be transformed into a plant cell. Plant tissues suitable for 
transformation include leaf tissues, root tissues, meristems, and protoplasts. 

Bacteria from the genus Agrobactenum can be utilized to transform plant cells. 
Suitable species of such bacterium include Agrobacterium tumefaciens and Agrobactenum 
rhtzogens. Agrobacterium tumefaciens (e.g.. strains LBA4404 or EHA105) is particularly 
useful due to its well-known ability to transform plants. 

Another approach to transforming plant cells with a gene involves propelling inert or 
biologically active particles at plant tissues and cells. This technique is disclosed in U.S. 
Patent Nos. 4,945,050; 5,036,006, and 5,100.792 all to Sanford et al. Generally, this 
procedure involves propelling inert or biologically active particles at the cells under 
conditions effective to penetrate the outer surface of the cell and afford incorporation within 
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the ,„,er,or thereof. When , ne „ pan , c|es afe otiii2ed me vectof ^ ^ ^ 

CI by coating the particles w,,h ,he vector con,a,„,„ g lnc des.red gene . A1(erna „ ve|y „, 
targe, eel, can oe surrounded by , h . veco, so tha, the vecor Is carried i„,o the ce„ by the 
waKe o, the particle. Bio,og,ca„y acttve particles (e.g.. dr,ed yeas, cel,s. dried bacenun, or a 
bacier.ophage. each containing DNA sough, ,o be introduced, can a,so be propelled ,„,o 



plant cell tissue. 



The .so.ated gen. .ragman, of ,he presan, ,nven„on can be utilized ,o confer 
d.sease res.stance ,o a w,de var.ety „ p,an, celis. inctuding those ., gymnosperms 
monocots. and d,co,s. Although ,he gena can be ,nser,ed i„,o any p,an, cel, fa,„ng w.fbin 
,hese broad classes. „ „ particularty usetu, ,n crop pian, ceils, such as r,ce. wheat, bariey 

cau„„ower. broccoli. ,urn,p. rad,sh. spinach, asparagus. on,o„, gar„c. eggplan,. pepper 
ce,ery. carrot, sguash. pumpKin. 2UCC hin, cucumber, appie. pea, q u,nce. meion plum 
=".rry. Peach, nectarine, apneot. strawberry, grape, raspberry, blacKberry. p,„eap P ,e 
avocado, papaya, mango, banana, soybean, .obacco. tomato, sorghum and sugarcane 

The express.on system o, the present ,„ven„o„ can be used to transform virtuaity 
any crop pian, cei, under su.table conditions. Transformed oeils can be regenerated into 
whole ptants such tha, the gene imparts disease resistance to the ,n,ac, transgenic plants 
As se, torth above, the expression system can be modified so tha, the disease res.stance 
gene is continuously or const.tut.vely expressed. 



Transformation 

The present system can be utilized in an, 

tion ana regenerate are well known m the art. 
ices, see also, An, G . Watson, B.D., and Ch,ang, C.C. 



iy plant which can be transformed and 

regenerated. Such methods for transformat. 
As well as the above cited referenc 



SUBSTITUTE SHEET (RULE 26) 



WO 97/49822 



PCT/EP97/0I218 



- 13 - 

Transformation of tobacco, tomato, potato, and Arabidopsis thaliana using a binary Tt 
vector system. Plant Physiol. 81:301-305, 1986; Fry, J,, Barnason, A., and Horsch, R.B. 
Transformation of Brassica napus with Agrobactenum tumefaciens based vectors. PI. Cell 
Rep, 6:321-325, 1987; Block, M.d. Genotype independent leaf disc transformation of potato 
(Solanum tuberosum) using Agrobactenum tumefaciens. Theor.appLgenet. 76:767-774, 
1988; Deblock, M., Brouwer, D.D., and Tennmg, P. Transformation of Brassica napus and 
Brassica oieracea using Agrobactenum tumefaciens and the Expression of the bar and neo 
genes in the transgenic plants. Plant Physiol. 91 :694-701, 1989; Baribauit, T.J., Skene, 
K.G.M., Cain, P. A., and Scott, N.S. Transgenic grapevines: regeneration of shoots 
expressing beta-glucuronidase. PI. Cell Rep. 41:1045-1049, 1990; Hinchee, M.A.W.. Newell, 
C.A.. ConnorWard, D.V., Armstrong, T.A., Deaton, W.R., Sato, S.S., and Rozman, R.J. 
Transformation and regeneration of non-solanaceous crop plants. Stadler. Genet. Symp. 
203212.203-212, 1990; Barfieid, D.G. and Pua, E.C. Gene transfer in plants of Brassica 
juncea using Agrobactenum tumefaciens-mediated transformation. PI. Cell Rep. 10:308-314, 
1991; Cousins, Y.L., Lyon, B.R., and Llewellyn, D.J. Transformation of an Australian cotton 
cultivar: prospects for cotton improvement through genetic engineering. Aust. J. Plant 
Physiol 18 481 -494 1991: Chee, P.P. and Siightom. J.L. Transformation of Cucumber 
Tissues by Microprojectile Bombardment Identification of Plants Containing Functional and 
Nonfunctional Transferred Genes. GENE 1 18:255-260. 1992; Christou, P., Ford. T.L.. and 
Kofron . M. The development of a variety-independent gene-transfer method for nee. 
Trends. Biotechnol. 10:239-246, 1992; D'Halluin, K , Bossut, M., Bonne, E. ( Mazur, B , 
Leemans, J., and Botterman, J. Transformation of sugarbeet (Beta vulgaris L ) and 
evaluation of herbicide resistance in transgenic plants. Bto/Technol. 10:309-314 1992; Dhir, 
S.K.. Dhir. S.. Savka. M.A., Belanger, F., Kriz. A.L.. Farrand. S.K.. and Widholm. J.M. 
Regeneration of Transgenic Soybean (Glycine Max) Plants from Electroporated Protoplasts 
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PLANT PHYSIOL 99:81-88, 1992: Ha. S.B.. Wu. F.S.. and Thome. T. K. Transgenic turf- 
type tall fescue (Festuca arundinacea Schreb.) plants regenerated from protoplasts. PI. Cell 
Rep. n:601 -604. 1992: Blechl, A.E. Genet.c Transformation The New Tool for Wheat 
improvement 78th Annual Meet.ng Keynote Address. CEREAL FOOD WORLD 38:846-847. 
1993; Casas. A.M., Kononowicz. A.K., Zehr. U.B., Tomes. D.T.. Axtell, J.D.. Butler. L.G., 
Bressan, R.A.. and Hasegawa, P.M. Transgenic Sorghum Plants via Microprojectile 
Bombardment. PROC NAT ACAD SC, USA 90:1 12I2A 1216, 1993; Christou, P. Philosoohy 
and Pract.ce of Vanety Independent Gene Transfer into Recalcitrant Croos. IN VITRO CELL 
DEV BIOL-PLANT 29P: 119-124, 1993; Damiani. P., Nenz. E„ Paolocc, P.. and Arcion,, S. 
introduction of Hygromycin Resistance in Lotus spp Through Agrobacterium Rhizogenes 
Transformation. TRANSGENIC RES 2:330-335, 1993; Davies. D R., Hamilton, J„ and 
Mullineaux. P. Transformation of Peas. PLCs,, R ep . 12:180-183. 1993; Dong, J.Z. and 
Mchughen. A. Transgenic F.ax P,ants from Agrobacterium Mediated Transformation 
incidence of Chimeric Regenerants and Inheritance of Transgenic Plants. PLANT SCI 
91:139-148. 1993; Fitch. M.M.M., Manshardt. R.M., Gonsa.ves, D., and Slighter,. J.L. 
Transgenic Papaya Plants from Agrobacterium Mediated Transformation of Somatic 
Embryos. Pl.Cel, Rep. 12:245-249, 1993; Frank„n. C.I. and Trieu, T.N. Transformation of 
the Forage Grass Caucasian Bluestem via Biolistic Bombardment Mediated DNA Transfer. 
PLANT PHYS,OL 102:167, 1993; Golovkin. M.V . Abraham, M.. Morocz, S.. Bottka, S.. 
Feher. A., and Dudits, D. Production of Transgenic Maize P.ants by Direct DNA Uptake ,r,o 
Embryogemc Protoplasts. PLANT SCI 90:41-52, 1993; Guo. G.Q., Xu. Z.H., We,, Z.M.. and 
Chen. H.M. Transgenic P.ants Obtained from Wheat Protoplasts Transformed by Peg 
Mediated Direct Gene Transfer. CHIN SC, BULL 38:2072-2078. 1993; Asano. Y. and Ugaki. 
M. Transgenic plants of Agrost.s alba obtained by electroporationmediated direct gene 
transfer into protoplasts. PLCell Rep. 13, 1994; Ayres. N.M. and Park, W D. Genetic 
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Transformation of Rice. CRIT REV PLANT SCI 13:219-239, 1994; Barceio, P., Hagel, C, 
Becker, D., Martin, A., and Lorz. H. Transgenic Cereal (Tritordeum) Plants Obtained at High 
Efficiency by Microprojectile Bombardment of Inflorescence Tissue. PLANT J 5:583-592. 
1994; Becker, D. T Brettschneider, R., and Lorz, H. Fertile Transgenic Wheat from 
Microprojectile Bombardment of Scutellar Tissue. PLANT J 5:299-307, 1994; Biswas, 
G.C.G., Iglesias, V.A.. Datta, S.K., and Potrykus, I. Transgenic Indica Rice (Oryza Sativa L) 
Plants Obtained by Direct Gene Transfer to Protoplasts. J BIOTECHNOL 32: 1-10, 1 994; 
Borkowska, M., Kleczkowski, K., Klos, B., Jakubiec, J., and Wielgat, B. Transformation of 
Diploid Potato with an Agrobacterium Tumefaciens Binary Vector System .1. Methodological 
Approach, ACTA PHYSIOL PLANT 1 6:225-230, 1994; Brar, G.S., Cohen, B.A., Vick, C.L., 
and Johnson, G.W. Recovery of Transgenic Peanut (Arachis Hypogaea L) Plants from Elite 
Cuitivars Utilizing Accell(R) Technology. PLANT J 5:745-753, 1994; Christou, P. Genetic 
Engineering of Crop Legumes and Cereals Current Status and Recent Advances. AGRO 
POOD (NO HI TECH 5: 17-27, 1994; Chupeau, M.C., Pautot, V. f and Chupeau, Y. Recovery 
of Transgenic Trees After Electroporation of Poplar Protoplasts. TRANSGENIC RES 3: 13- 
19, 1994; Eapen, S. and George, L. Agrobacterium Tumefaciens Mediated Gene Transfer 
m Peanut (Arachis Hypogaea L). PI Cell Rep 13 582-586, 1994; Hartman, C.L.. Lee, L.. 
Day, P R., and Turner, N.E. Herbicide Resistant Turfgrass (Agrostis Palustris Huds) by 
Biohstic Transformation. BID-TECHNOLOGY 1 2:91 9923. 1994; Howe, G.T., Goldfarb, B., 
and Strauss. S.H. Agrobacterium Mediated Transformation of Hybrid Poplar Suspension 
Cultures and Regeneration of Transformed Plants. Plant Cell Tissue & Organ Culture 
36:59-71, 1 994; Konwar, B.K. Agrobacterium Tumefaciens Mediated Genetic 
Transformation of Sugar Beet (Beta Vulgaris L) J PLANTBIOCHEM BIOTECHNOL 3:37- 
41, 1994; Ritala. A., Aspegren, K., Kurten, U.. Salmenkalliomarttila. M., Mannonen, L., 
Hannus, R., Kauppinen, V.. Teen, T.H., and Enan, f.M Fertile Transgenic Barley by 
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Part,:e Bombay, 0( Immature Embryos. PLANT MOL BIOL 24:3,7-325. ,994: Scorza. 
a. C -..„s. J.M.. Ramm.ng. o.w.. a « Emershad. R L. Transformation of Grape (Vit.s 
V.n.fera L, Soma„c Embryos and Regenerate o, Transgenic Plants j cm b/ochem 

:102, 1994; Shimamoto, K Gene Fynroe^i^r, -r 

o. bene Expression in Transgenic Monocots CURR 

OMOTECHNOL*,*.™. ,994; Spangenberg. G.. Wang. Z v.. Nagel , and 
PotryKus. ,. Protoplast Culture and Generation „, Transgenic Piants ,n Red Fescue 
(Festuca Rubra u. PWA/r SC, 97:83-94. ,994; Spangenberg, G.. Wang, z V Nage, J 
and Po,„ k us. ,. Gene Transfer and Regeneration o, Transgenic Piants ,„ Forage Grasses 
J CELL B,OCHEM,02. , 994: wa , Y . c . ana Lemau ^ p g Generaiion o( uige Num6flrs 

of independent Transformed Fertile Barley Plants. PLANT PHYSIOL ,04:3748. ,994- 
WeeKs. J.T.. Anderson. O.D.. and Biechi. A.E. stabfe Transformed o, Wheat ,!„„« 
Aestivum L, by Microb,o,ec,i,e Bonnbardment. j CELL BIOCHEM :,04. ,994 Ye X J 
Brown. S.K.. Sccza. R.. Cordts. J., and Sanford. J.c. Gene„c Transformed o, Peach 
T.ssues by Par„c,e Bombardment. JAMER SOCHOR TSC, ,, 9:367-373 ,994 
Spangenberg. G., Wang. Z.V.. Nage,. J., and PotryKus. I. PROTOPLAST CULTURE AND 
GENERATION OF TRANSGENIC PLANTS IN RED FESCUE (FESTUCA RUBRA L,. Plsn, 
Science 1994 97:83-94, 1995. 

As ni m1 nos, plants may also be susceptible to pathogens outs.de of the hos,-range 
mto wh,ch they normaliy ,a„. these plants a,so have sign.f.can, u,„„v , n the molecular 
genetic, and biological study o, host-pathogen interacons. Furthermore the UDS 
pnenotype o, n, m , plants afso renders them o, u ,„„y (or fung.ode screen.ng. nim , mutants 
selected ,„ a part.cular host have cons.derabie u,„„y tor the sc,een,n g o, ,ung,c,des us,ng 
•h.. bos, and pathogens o, the host The advantage „es ,n the UDS Phenoytpe o, the 
mutant. v-h lc h circumvents the problems encountered due to hosts .e.ng d„,eren„a„y 
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susceptible to different pathogens and pathotypes, or even resistant to some pathogens or 
pathotypes. 

Pathogens of the invention include but are not limited to viruses or viroids, e.g. 
tobacco or cucumber mosaic virus, ringspot virus or necrosis virus, pelargonium leaf curl 
virus, red clover mottle virus, tomato bushy stunt virus, and like viruses; fungi, e.g. 
Phythophthora parasitica and Peronospora tabacina: bacteria, e.g. Pseudomonas syringae 
and Pseudomonas tabacf, insects such as aphids, e.g. Myzus persicae; and lepidoptera, 
e.g., Hetiothus spp.; and nematodes, e.g., Meloidogyne incognita. The methods of the 
invention are useful against a number of disease organisms of maize including but not 
limited to downy mildews such as Scleropthora macrospora, Scierophthora rayissiae, 
Sclerospora gramin/cola, Peronosclerospora sorghi, Peronosclerospora phiiipptnensis, 
Peronoscierospora sacchah and Peronosclerospora maydis: rusts such as Puccinia sorphi, 
Puccinta polysora and Physopelia zeae; other fungi such as Cercospora zeae-maydis, 
Cofletotnchum grammicola, Fusarium monoliforme, Gibberella zeae. Exserohilum turcicum, 
Kabatteilu zeae and Bipotar/s maydis: and bacteria such as Erwinia stewartii. 

DESCRIPTION OF TH E SEQUE NCE LISTING 

SEQ ID NO 1 - 9919-bp genomic sequence from Figure 14. 

SEQ ID NO:2 - 5655-bp genomic sequence from Figure 15. 

SEQ ID NO:3 - AA sequence of wild-type NIM protein encoded by cds of seq id 

no:2. 

SEQ ID NO:4 - Rice-1 AA sequence 33-155 from Figure 19. 
SEQ ID NO:5 * Rice-1 AA sequence 215-328 from Figure 19. 
SEQ ID NO:6 - Rice-2 AA sequence 33-155 from Figure 19. 
SEQ ID NO:7 - Rice-2 AA sequence 208-288 from Figure 19 



SUBSTITUTE SHEET (RULE 26) 



WO 97/49822 



PCT/EP97/01218 



18 



SEO IP no:8 - ri.ce-3 AA sequence 33-155 from Figure 19. 
SEQ ID NO:9 - R,ce-3 AA sequence 208-288 from Figure 19. 
SEQ ID NO:10 - R.ce-4 AA sequence 33-155 from Figure 19. 
SEQ ID NO.1 1 - R,ce-4 AA sequence 215-271 from Figure 19. 

DEPOSITS 

The .cowing vector molecules have been depose with Amertcan Type Cutture 
CoUectlon ,230, ParMawn Dnve Rockv.tte. MD 20852 U.S.A. on the dates indicated 

below: 

Plasm.d BAC-04 was depos.ted with ATCC on May 8. 1 996 as ATCC 97543. 
Plasm.d P1-18 was deposited w.th ATCC on June 13, 1996 as ATCC 97606. 
Cosm.d D7 was depos.ted with ATCC on September 25, 1996 as ATC 977 36 . 

EXAMP1.FS 

Example 1 

lden„, y , n g MM, Cones by map-based Con.ng. High resotution genetic mapping 
and physical mapping of NIM1 in Arabidopsis. 

1. Plant Material and Isolation of niml Mutants. 

Mn, > mutants were ,so,a,ed trom two Arab.dopsis ecotype Ws-O ptan, populations, 
as described by Deianey e, a,.. (,995) PNAS 92. 6602-6606. One mutan, popu.a,,on was ,„ 
the tern, o, an M2 l.brary denved from ethy, methane suttona.e (EM8 , rnutagen.zed seeds 
(purchased trom Lehte, Round Rock. TX, and the other was in the torn, c. a T-0NA 
popu,a„o„ derived „om seed obtained from the Ohio State University Arabtdopsis Biologica, 
Resource Center (Columbus, OH). 
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The basis of the screen for noninducible immunity (n/ml) mutants was tc screen 
mutagenized plant populations for plants in which resistance to a virulent pathogen could 
not be induced by INA (2,6 dichloro isonicotinic acid; Metraux, et a!., 1991 . In: Advances in 
Molecular Genetics of Plant-Microbe Interactions. Vol .1 , 432-439. Hennecke and Verma, 
eds.; Kessmann et al. 1993 In: Mode of action of agrochemicals. Y Honma, ed.; Vernooij et 
al. 1995, Molec. PL Microbe Interaction 8, 228-234). 

Plants from the mutant populations were grown at high density in large trays in 
commercial planting mix. When the plants were 2 weeks of age ; the trays were sprayed 
with 0.25 mg/ml INA. Four days later, the plants were sprayed with a spore suspension of 
Peronospora parasitica, isolate EmWa (EmWa), at 5x1 0 4 to 1x10 5 spores/ml. This fungus 
is normally virulent on the Arabidopsis Ws-O ecotype, unless resistance is first induced in 
these plants with INA or a similar compound. 

Following incubation in a high humidity environment, plants with visible disease 
symptoms were identified, typically 7 days after the infection. These plants did not show 
resistance to the fungus, despite the application of the resistance-inducing chemical and 
were thus potential nim (non-immunity) mutant plants. From 360,000 plants, 75 potential 
mm mutants were identified. 

These potential mutant plants were isolated from the flat, placed under low humidity 
conditions and allowed to set seed. Plants derived from this seed were screened in an 
identical manner for susceptibility to the fungus EmWa, again after pretreatmem with INA. 
The progeny plants that showed infection symptoms were defined as nim mutants. Six mm 
lines were thus identified. One line (mm1) was isolated from the T-DNA population and five 
from the EMS population. 
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2. Sec 



Reactions to IN A and Other Chemical Inducers of D.sease 
Resistance. 

i. Phenotypic analysis of ntml. 

Salicylic acid (SA) and benzo(1 ,2.3)thiadiazole-7-carbothioic acid S-methy. ester 
(BTH) are two chemicals that, like INA. induce broad spectrum d.sease resistance, termed 
Systemic Acquired Resistance (SAR). in w.ldtype plants. Since INA did not induce 
resistance ,n the n,m1 plants, these plants were a.so evaluated for the.r disease resistance 
response following pretreatment with SA and BTH (as partly described in De.aney et al, 
1995, PNAS 92,6602-6606). 

Plants were sprayed with 1 , 5, or 15 mM SA or 0.25 mg/m, BTH and challenge 
inoculated w,th EmWa 5 days later (as described in example 1 above). Both SA and BTH 
failed to protect niml plants from fungal infection, as evidenced by the presence of disease 
symptoms and fungal growth on these plants. Thus, the niml plants were not responsive to 
any of the SAP-inducing chemicals, .mp.ying that the mutation was downstream of the entry 
point(s) for these chemicals in the resistance induction pathway. 

Niml was also evaluated for ,ts disease response to infection with 2 incompatible P. 
parasitica isolates, We.a and Noco (i.e. these funga, strains do not cause disease on 
w,ldtype Ws-O plants), niml p (an ,s were sprayed with con.dia. suspensions of 5-10x10* 
spores/m, of Wela or Noco and .ncubated under high hum,d,ty for 7 days. Un„ ke w.ldtype 
Plants, nim, plants developed d.sease symptoms ,n response to both Wela and Noco 
■ nfection, The symptoms were necrotic flecking and trailing. w„h some sporu.at.on. 
Fo„ow,ng lactopheno. blue staining, funga. hyphae were easily observed ,n the leaves of 
nimt plants. Thus, the mm1 plants are susceptible to normally incompatible P paraS itica 
isolates. Th,s result shows that the n,m, p.ants are not only defective in chemically induced 
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disease resistance, but are also defective in natural resistance to microorganisms that are 
normally not pathogenic. 

ii. Biochemical analysis of nimh 

SA, INA and BTH induce SAR and expression of the SAR genes, which include the 
Pathogenesis Related genes PR-1, PR-2 and PR-5 in Arabidopsis. Since these 
compounds did not induce disease resistance in ntml (as described in example 1 2 above), 
this mutant line was analyzed for SAR gene expression following SA, INA or BTH treatment. 

After treatment of ntml plants with SA, INA or BTH r plant tissue was harvested and 
analyzed for accumulation of RNA from the PR-1 , PR-2 and PR-5 genes. To this end. total 
RNA was isolated from the treated tissues and electrophoresed on an agarose gel. 
Triplicate gel biots were prepared and each was hybridized with a probe for one of these 3 
SAR genes as described in Deianey et at, 1995, PNAS 92, 6602-6606. In contrast to the 
case in wildtype plants, the chemicals did not induce RNA accumulation from any of these 3 
SAR genes in nim1 plants, as shown in figure 1. Taken together, the results indicate that 
the chemicals induce neither SAR nor SAR gene expression in nim1 plants. 

Since the chemicals did not induce SAR, or SAR gene expression in nim1 plants, it 
was of interest to investigate whether pathogen infection could induce SAR gene 
expression in these plants, as it does in wildtype plants. Ws-O and nim1 plants were 
sprayed with EmWa spores as described and tissue collected for RNA analysis at several 
hmepoints. Pathogen infection (EmWa) of wildtype Ws-O plants induced PR-1 gene 
expression within 4 days after infection, as shown in figure 2. In nim1 plants, however, PR- 
1 gene expression is not induced until 6 days after infection and the level is reduced relative 
to the wildtype at that time. Thus, following pathogen infection, PR-1 gene expression in 
ntml plants is delayed and reduced relative to the wildtype 
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infection of wildtype plants with pathogens that cause a necrotic reaction leads to 
accumulation of SA in the infected tissues. It has been shown that this endogenous SA is 
required for S1 gnal transduction ,n the SAR pathway, i.e.. breakdown of the endogenous SA 
leads to a decrease in disease resistance. This def.nes SA accumulation as a marker ,n the 
SAR pathway (Gaffney et al, 1993, Science 261, 754-756). 

Nim1 plants were tested for their ability to accumulate SA following pathogen 
mfect.on. Pseudomonas synngae tomato stra,n DC 3000, carrying the avrR P t2 gene, was 
injected into leaves of 4-week-old n,mi plants. The leaves were harvested 2 days later for 
SA analysis as described by Delaney et al, 1995, PNAS 92, 6602-6606. This analysis 
showed that the n.ml plants accumulated h.gh levels of SA in infected leaves, as shown in 
F.gure 3. Uninfected leaves also accumulated SA, but not to the same levels as the 
.nfected leaves. Similar to what has been observed in wild-type AraMopsis. This indicated 
that the nim mutation maps downstream of the SA marker in the signal transduction 
pathway. Th,s was anticipated, as INA and BTH (inactive in niml plants) are known to 
stimulate a component in the SAR pathway downstream of SA (Vernoo.j et al.. 1995. Molec 
PL Mcrobe interaction 8. 228-234; Friedrich, et al.. 1 996, The PJant Journal 9. in press; and 
Lawton, et a,., 1996. Jhe PJant Journal 9. in press). In add.tion, as described in Example 
1.2. exogenously applied SA did not protect niml from EmWa infection. 

3. Genetic analysis of niml. 

N,m1 plants were backcrossed to wild-type Ws-O plants, and F1 progeny were 
tested for resistance to EmWa after in A pretreatment. as described in Example 1.1 above. 
None of the INA-pretreated F1 plants had infection symptoms, whereas the niml control 
plants d,d show infection. Hence, the niml mutation was determined to be recess.ve. 
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The F2 population from the Ws-O x nim1 cross was also assayed for its disease 
resistance after INA pretreatment. Of this population, approximately 1/4 (32/130 plants) 
showed disease symptoms after ErrtWa treatment of INA pre-sprayed plants and 3/4 
(98/130 plants) showed no disease. These results indicate that the mm mutation identifies 
a single genetic locus and corroborates the F1 data that show the recessive nature of the 
mutation. 

4. Identification of markers in and genetic mapping of the NIM locus. 

For conventional map-based cloning of the NIM gene, markers had to be identified 
that were genetically closely linked to the mutation. This was accomplished in 2 steps. First, 
the niml plants were crossed to a different Arabidopsis genotype, Landsberg erecta (Ler), 
and F2 plants from this cross which had a niml phenotype (i.e. plants that are homozygous 
nim/nim at the NIM iocus) were identified. From these, plants that had a Ler genotype at a 
nearby DNA marker were identified by molecular analysis. These plants, by virtue of the 
identification criterion, are recombinant between the marker and the NIMlocus. The 
frequency of recombinants defines the genetic distance between the marker and the NIM 
locus. 

The second prerequisite for map based cloning is that markers are identified that are 
genetically very close to the NIM locus, i.e. markers that identify very few recombinants. If 
genetic markers are identified that are very close, than these can be used to isolate 
genomic DNA clones that are close to the NIM locus. The NIM locus can then be cloned by 
walking, if not already present on the cloned DNA. Walking can be initiated from both sides 
of the gene. It relies on obtaining overlapping clones that are successively closer to the 
gene of interest. When a single DNA marker is obtained from a walk initiated from, for 
instance, the North end and it identifies no recombinants between this marker and the gene 
of interest, it must be very close to the gene. However, if the marker does identify 
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recomomanus) from the South end. the clone from which the marker was obtained mus, 
have crossed the gene. By definition then, the gene of interest ,s cloned. ,t must be located 
between .his marker and the last North-end marker that -denies the least number of 
recombinants from the North end. 

in a f,rst step, a ,arge number of recombinants are generated by genetic crossing. In 
a second step, recombinants that are close to the NiM gene are identified with the use of 
molecular markers. Many markers have been described ,n the literature and severa, 
methods exist to develop add.t.ona. markers. Our approach has relied on a number of 
marker systems, including SSLPs and AFLPs (see below) 

i Genetic crosses. 

In order to map the chromosomal position of the MM gene relative tc the SSLP and 
AFLP markers, niml was crossed to Ler to make a mapping populate. F2 plants from th.s 
cross were grown and .eaves harvested for future DNA extractions. Next, the F2 plants 
were scored for the n,m1 phenotype. as described ,n example , above. Also. F3 
populations derived from ,nd,v,dual F2 plants were grown and scored for the nim 
Phenotype. DNA was extracted from the stored t.ssue of nim 1 phenotype F2 and F3 plants 
by the CTab method, as described (Rogers and Bendich. 1988, Plant Mo.ecular Biology 
Manual. A6, 1-10). This DNA was used for mapping the N,M gene, as described be, 



low. 



ii. Simple Sequence Length Polymorphism markers. 

The Simple Sequence Length Polymorphism (SSLP) markers ATHGENEA and 
ngam have been described (Bel, and Ecker, 1994, Genomics 19, 137-144) Primers used 
for detection of these SSLPs are listed in Table 1 . 
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Table 1. SSLP primer sequences. 

j primer set j primer sequence (5' to 3') 

J ATHGENEA (1) j ACC ATG CAT AGC TTA AAC 

jl TTC TTG 



i 


ACA TAA CCA CAA ATA GGG 


I 


GTG C 


|| ATHGENEA (2) 


ACC ATG CAT AGC TTA AAC 




TTC TTG 


! 

il 


CCA A AT GTC AAA ATA CTC 


i 

i 


GTC j 


ngam (1) 


CTC CAG TTG GAA GCT AAA 




GGG 




TGT TTT TTA GGA CAA ATG 




GCG 


I ngal 1 1 (2) 


CTC CAG TTG GAA GCT AAA G 




TGT TTT TTA GGA CAA ATG G 



Genetic mapping of the NIM gene relative to marker ATHGENEA. 

Using the ATHGENEA (1) primers for PGR amplification of Ler genomic DNA, a 205- 
basepair (bp) band was expected, whereas with Ws-O genomic DNA a band of 21 1 bp was 
expected (Bell and Ecker, 1994, Genomics 19, 137-144). The amplification products proved 
to be difficult to separate on conventional agarose gels. Hence, two alternative methods 
were developed for separation and detection of these PGR fragments 

In a first method, primerset ATHGENEA (1) (Table 1) was used to amplify genomic 
DNA in the presence of 6-carboxyrhodamine-labelled UTP (dUTP-Rno, obtained from 
ABI). yielding rhodamme-labelled PCR fragments. The PCR reactions were analyzed on a 

DNA Sequencer, which detects DNA fragments with single nucleotide resolution. 
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The spec.f.c reagents were: 1xPCR buffer. 2 mM MgCl2. dNTPs each 200 mM, 2 
mM dUTP-Rl 10 . ATHGENEA (1) pnmers at 0.75 mM. 10 ng DNA and 0.75 units Tag 
polymerase ,n a 20 m. reaction volume. Amplification conditions were: 3 minutes 94«C 
fo.lowed by 35 cycles of 15 seconds at 94°C, 15 seconds at 55°C and 30 seconds at 72°C. 
These samples were analyzed on an ABI 377 DNA Sequencer, capable of detect.ng 
fluorescently , abe ,ed DNA fragments with s,ng,e nuc.eot.de (nt.) resolution. Th.s a.lowed for 
genotyp.ng the p,ant samp.es: a 205-nucleot.de DNA fragment was obtained from Ler DNA 
and a 2 1 1 -nucleotide band from Ws-O DNA. Thus. DNA fragments d.ffenng by 6 
nucleotides ,n length could be easily distinguished, allowing for easy genotyp.ng of samples 
as homozygous Ws-O. homozygous Ler and heterozygous Ws-O/Ler at the ATHGENEA 



locus. 



in order to increase the throughput of this system, a multiplexing scheme was used. 
Some DNA samples were PGR amplif.ed as described above w„h primer set ATHGENEA 
(D , whereas other samples were analyzed with pr.merset ATHGENEA (2) („ ste d ,n tab.e 2). 
■ n each case in the presence of 6-carboxyrhodamine-labeled dUTP. Pnmer set ATHGENEA 
(2) was made based on the oubl.shed sequence of ATHGENEA (Simoens et al., 1988, 
Gene 67, 1-1 1). Th.s primerset amplified a DNA fragment of 139 bp from Ler DNA and a 
145-bp band from Ws-O DNA. Amp.ificat.on reaction conditions for primerset ATHGENEA 
(2) were identical to those descr.bed for primerset ATHGENEA (1), above. 

Single react.ons us.ng pnmer set ATHGENEA (1) and s.ngle reactions using pnmer 
set ATHGENEA (2) were m,xed together before electrophoresis on the AB. 377 DNA 
Sequencer. This muUiplex.ng approach a..owed for genotyp.ng 2 samples in a s.nge iane of 
me Sequencer, one at positions 145/139 nt and one at pos.ons 21 1/205 nt. on the 
Sequencer 
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In the second method, PCR fragments were labelled by using a pnmer labelled with 
the fluorescent dye FAM-6 (6-carboxyfluorescein) (Integrated DNA Technologies, inc.). The 
forward ATHGENEA primers of the ATHGENEA (1) and (2) primer sets are identical in 
sequence (see Table 1). This primer was labeled with FAM-6 and used in a PCR 
amplification reaction with the following reagents (Perkin Elmer): 1xXL buffer, 1 mM MgCI2, 
dNTPs each at 200 mM, primers each at 0.50 mM (forward primer FAM-6 labeled), 10 ng 
genomic DNA and 0.5 units XL polymerase in a 20-ml reaction volume. The cycling 
conditions were: 3 minutes at 94°C. followed by 35 cycles of 15 seconds at 94°C, 15 
seconds at 59°C and 30 seconds at 72°C. Again, single reactions using primer set 
ATHGENEA (1) and single reactions using primer set ATHGENEA (2) were mixed together 
before electrophoresis on the ABI 377 DNA Sequencher. This multiplexing approach 
allowed for genotyping 2 samples in a single lane of the Sequencer, one at positions 
145/139 nt. and one at positions 21 1/205 nt. 

All F2 and F3 samples from nim1 phenotype plants were scored for their genotype 
at the ATHGENEA locus as described above. All samples that were heterozygous at this 
locus identified plants that were recombinant between the NIM1 locus and the ATHGENEA 
locus. In a population of 1144 F2 nim1 phenotype plants and F3 nim1 phenotype 
populations that were scored in this way, 98 were heterozygous at the ATHGENEA locus, 
giving an estimate of the genetic distance between this SSLP locus and the NIM1 locus of 
4.3 cM. This established that the NIM1 locus was on chromosome 1 , near the ATHGENEA 
marker 

Genetic mapping of the NIM1 gene relative to marker ngal 1 1 . 

Two primer sets for SSLP marker ngal 1 1 (described in Bell and Ecker, 1994, 
Genomics 19, 137-144) were used to amplify genomic DNA of F2 and F3 niml phenotype 
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P-ant. and con.ro, Ws-O and Ler plan.s Primer se, nga, , , „, (aescr , 0e[J , n 6e „ anfl 
Ecker. ,994. Genom.cs ,9. 137-144 and lis.ed in Table 1, was used under me .o.low.ng 
conation.: ,xPCR bu.ler. 2 mM MgC,2. dNTPs each a, 200 mM . pnmers a, 0 75 mM. ,0 
ng DNA and 0.75 un„ Taq polymerase in a 20 ml reaCon voiume. Pnmer se, nga, 1 1 (2) 
(l.sled in Tabie , . and a derivahve of pnmer se, nga,,, „)) was used unde , differen , 
cond.hons: ,«PCR buHer. 1.5 mM Mg c,2. dNTPs each 200 mM, pnmers a, 1 mM. ,0 ng 
DNA and , uni, Tag polymerase ,„ a 20 m, reaction volume. Both reactions were am P „„ed 
by mcubahon a, 94-C .or , m,„ u ,e. , oMow ed by 40 cyc|6s Qf , 5 ^ ^ ^ 

seconds at 55*C and 30 seconds at 72°C. 



The samp.es were analyzed on 3-5% agarose gels. The band obtained from 
am P „ ( ,ca„o„ o, Ws-0 DNA w,.h e„her pnme, se, was ,46 bp. whereas amp,i,y,ng Ler DNA 
resu„ed in a ,62-bp band Plan, samp.es ,„., were heterozygous a, ,he nga, , , ,„cus 
■denied p.an.s ,ha, were recombinan, be.ween ,h,s SSLP marker and ,he M M locus 
Among , ,44 F2 n m , pheno.ype plan.s and F3 n m l pheno.ype P opu,a„ons, 239 were 
■denied as heterozygous ,or ,he nga, , , marker, giving an estima.e ,or ,he genet.c 
d.stance be,ween me SSLP marker and ,he N/M.ocus o, ,0.4 cM. This corrobora.ed ,ha, 
me NIM1 ,ocus was on chromosome 1 Since ,ew nm, phenotype p.anis ex.s.ed ,ha, were 
he.erozygous a, bo.h ATHGENEA and nga, 1 1 . , h . MM , , ocus was a ^ 

be,ween ,„ese 2 markers, with ATHGENEA .oca.ed Nor.h 0 , , he N,M, gene ano nga , , , 
loca,ed Sou,h o. Ihe N IM , gene This placed ,he NIMt gene approx.ma.ely ,0 cM nor.h o, 
nga,, ,. near p os„ion 85 on Chromosome , «Lis,e, and Dean, ,993. Plan, J. 4. 745-750: 
Bell and Ecker. 1994, Genomics 19, 137-144) 
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in. Amplified Fragment Length Polymorphism markers. 

For map-based cloning of the NIM1 gene, it is necessary to identify molecular 
markers that are successively closer to this gene. For this purpose, Amplified Fragment 
Length Polymorphism (AFLP) markers were generated by using the selective restriction 
fragment amplification method described by Zabeau and Vos (1993, European Patent 
Application EP 0534858) and Vos et aL (1995, Nucleic Acid Research 23, 4407-4414). 

Outline of the AFLP Technology. 

The use of the AFLP technology in mapping relies on selective amplification of a set 
of DNA bands in 2 genetically distinct samples. Finding that any of the obtained bands are 
different between the 2 genotypes identifies those bands as markers for that genotype. If 
the marker cosegregates at high frequency with the gene (mutation) of interest, then the 
marker is dose to the genetic locus. 

Selective amplification of a small set of DNA fragments in a complex DNA sample is 
achieved in a 2-step process. First, DNA fragments are generated by digesting the DNA 
with restriction enzymes, followed by ligation of adapters to the ends. Second, primers 
consisting of a sequence complementary to the adapters plus a 3' extension (typically 0-3 
nucleotides) are used to amplify only those DNA fragments with ends that are 
complementary to these primers. If a single nucleotide extension is used, then theoretically, 
each primer wilt "fit" on approximately 1/4 of all fragments, with 1/16 of all fragments having 
a primer fit on ooth ends. Thus, a limited set of DNA fragments is amplified with these 
primers. By further radiolabellmg one primer, an even smaller subset of visible bands can 
be ootatned 
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AFLP analysis. 

FO, AFLP ana,ys,s of DNA sampies. 50 ng ONA was digested w„h .ha appropriate 
enzymes <usua„y EcoR, and Msei; see be.ow, and adapters ,„s,ed ,n tabie 2 beiow, were 
legated ,o the restriction fragments (usualiy EcoR. and Msel). The sequences of the prtmers 
and the VAC. P, and BAG Cones are descnbed ,„ detai, be.ow. The .emp.ates were used 
.or amplication reactions (appro.ma.e.y 0.5 ng DNA per reaction,, us.ng primers tna, were 
comp.ementary to the adapters, with short 3' extensions « or 3 nuc.eotides p„me, 
sequences are „s,ed be.ow,. S.nce one o, the primers is radioacve.y ,abe„ed <usua„y the 
E=0R, pnmer,, on,y a suPse, o, the ampi.f.ed .ragments ,s vis.b.e upon autoradiography 
analysts of the gel used to separate the bands. 
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Amplification conditions for cloned DNA (YAC, P1, cosmid) were as follows: 36 
cycles of 30 sec. at 94°C (denaturation), 30 sec annealing and 1 mm extension at 72°C. 
The annealing temperature rn the first cycle was 65 3 C and was reduced by 0.7°C in each 
cycle for the next 1 2 cycles and then kept at 56°C. For genomic DNA of Arabidopsis plants, 
the amplification was performed in 2 steps: in the first step (preampfification), the DNA was 
amplified with primers that have a single nucleotide extension (neither primer was labeled). 
Reaction conditions for this amplification reaction were: 20 cycles of 30 sec. denaturation 
(94°C). 1 mm annealing (56°C) and 1 min extension f72°C). In the second step, the first 
amplification reaction was diluted 10 fold and reamplified 36 cycles with primers containing 
the full-size extensions (using one labeled primer) under the following conditions: 30 sec. at 
94°C (denaturation), 30 sec annealing and 1 mm extension at 72°C. The annealing 
temperature in the first cycle was 65°C and was reduced by 0.7°C in each cycle for the next 
12 cycles and then kept at 56°C. The final reaction products were separated on a 
polyacrylamide gel and the gel was exposed to film, allowing visualization of the 
radiolabeled PCR bands. When this procedure was applied to DNA from 2 genotypes 
simultaneously, AFLP bands were identified that were diagnostic for one genotype or the 
other. Such bands are called informative AFLP bands, or AFLP markers. Table 2 shows 
Adapters used in the AFLP analysis. 



SUBSTITUTE SHEET (RULE 26) 



WO 97/49822 



PCT/EP97/01218 



32 



enzyme adapter 

EcoRI 5'-CTCGTAGACTGCGTACC-3' 

3'-CATCTGACGCATGGTTAA-5' 

HindHI 5'-CTCGTAGACTGCGTACC-3' 

-V-CATCTGACGCATGGTCGA-5' 

PstI 5' -CTCGTAGACTGCGTACATGCA-3' 

3'-CATCTGACGCATGT-5' 
5-GACGATGAGTCCTGAG-3' 
3'-TACTCAGGACTCAT-5' 



Msel 



Generation of AFLP markers and fine mapping of the NIM1 locus. 
A populat.cn of recombinant inbred lines derived from a cross between the AraMopsis 
ecotypes Landsberg erecta (Ler) and Columbia (Col) (Lister and Dean. 1993, Plant J. 4, 
745-750) was used for AFLP marker screening. The primers used for the AFLP screening 



were: 



EcoRI-pnmers: 5 -GACTGCGTACCAATTCWN-3' 
Msel-pnmers: 5' GATGAGTCCTGAGTA AXWN-3' 

An "N" ,n the pr.mers indicates that this part was variable (A, C. G or T), a "W" 
-ndicates A or T. and an "X" indicates a C. All 8 P oss lb le pnmers were used for both the 
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EcoRI- and Msel-pnmer. This gave a total of 64 (8 x 8) primer combinations (PCs) that 
were used to amplify DNA from the recombinant inbred line and the parental genotypes, Ler 
and Coi, as described above. The amplification reactions were run on a denaturing 
polyacrylamide gel to separate AFLP fragments by size and the gel was exposed to film. 
The film was inspected for bands that were present in only one genotype, i.e. inspected for 
AFLP markers. 

The AFLP markers, i.e.. DNA fragments that are polymorphic between both parents 
of the recombinant inbred lines, were used for constructing a genetic map of the 
recombinant inbred line population. Exampie 1 .5i below describes the mapping of the NIM1 
gene on Arabidopsis chromosome 1, at approximately position 85. Those AFLP markers 
that had been mapped (using the recombinant inbred line) between positions 81 and 88 of 
Arabidopsis chromosome 1 were chosen for analyzing recombinant plants for the presence 
of said AFLP markers and thus for mapping the NIM1 gene more precisely. Seven AFLP 
markers from this region were identified as being informative; they were polymorphic 
between both parents of the n/m/xLer cross. Six AFLP markers were Ler-specrfic, i.e. 
these AFLP markers were absent in Ws (and in Col as well). One AFLP marker was Ws- 
specific, i.e. a Col-specific AFLP marker (absent in Ler) was also present in Ws. These 
AFLP markers are: L81.1, L81.2, W83.1. L84, L85. L87 and L88 (an L-marker is specific for 
ecotype Ler and a W-marker is specific for both the ecotypes Col and Ws; the number 
indicates the map position). These AFLP markers were used to analyze recombinant piants 
from tne n/m/xLer cross (see below). In addition, AFLP marker C86 (a recombinant, mbred- 
line-derived marker specific for Col) was used in isolating DNA clones (see below). Table 3 
lists the primer sequences that were used to obtain these AFLP markers 
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Table 3 shows primer comb.nat.ons of AFLP markers derived from recomb.nant 
mured line population. 

"EcoRI-" refers to the sequence 5'-GACTGCGTACCAATTC-3' and 
"Msel-" refers to the sequence 5'-GATGAGTCCTGAGTAA-3'. 

Table 3 



AFLP marker Corresponding primer combinations 



L81.1 


EcoRI-CA 


Msei-CCG 


L81.2 


EcoRf-AA 


Msel-CAA 


W83.1 


EcoRI-CA 


Msei-CTC 


L84 


EcoRI-AAT 


Msel-CAA 


L85 


EcoRI-CA 


Msel-CCT 


L87 


EcoRI-CA 


Msel-CTT 


L88 


EcoRI-AG 


Msel-CTA 


C86 


EcoRI-AG 


Msel-CCT 



A deta.led genetic map of the reg,on was constructed using the AFLP markers 
described above by typ.ng the recombinants. A tota, of 337 recombinant plants were 
ava.lab.e out of 1 1 44 F2 „/*,/ p.ants. These recombinants were first screened with the 
North-f.ank.ng AFLP markers L81.2 and ATHGENEA and the South-flanking markers L88 
and ngal 1 1 . Forty-eight plants were homozygous nimllniml and heterozygous at 
ATHGENEA and L81.2, and 21 piants were homozygous nimUnimi and heterozygous at 
ngal 1 1 and L88. These recomb.nant p.ants were further analyzed w.th 9 AFLP markers ,n 
the /V/M region, including 4 AFLP markers that were derived from the recombinant ,nbred 
•me mapping population, (W83.1, L84, L85 and L87) and 5 AFLP markers denved from 
analysis of YAC clones (W83.3/W84 1 . W84.2, W85.1 , W86.1 and L86. see below). 

The genetic map of NIMl. based on this anaiys.s. ,s dep.cted ,n Figure 4 As seen, 
27 recombinants were found between marker W84.2 and NIM1 and 1 4 recombinant 



ts were 
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found between W85.1 and NIM1. Marker L85 is linked closely to NIMh but this marker 
could not be mapped on the YAC, BAC or P1 clones (see below) and was, therefore, not 

useable for identification of the NIM1 gene. 

5. Physical mapping of the NIM1 region. 

i. Isolation of YAC clones using AFLP markers closely linked to NIM1. 

The CiC library, an Arabidopsis ecotype Columbia YAC library (Bouchez et a!, 1995, 
6th Int. Conf on Arabidopsis Research, Madison, Wl), was screened for YAC clones in the 
NIM region. This library has about 2.5 nuclear genome equivalents and has an average 
insert size of 450 kb. The YAC library was screened with two AFLP markers: W83.1 and 
C86. W83.1 is the most closely linked recombinant, inbred-line-derived AFLP marker north 
of NIM1, and C86 is a recombinant, inbred-line-derived AFLP marker specific for Col 
(absent in Ler and Ws). C86 mapped south of the NIM1 gene on the map of the 
recombinant inbred line population. This Col AFLP marker has been used instead of the 
closely linked Ler AFLP markers (Figure 4), because the latter AFLP markers detected only 
ecotype Landsberg erecta and hence cannot be used for screening the Columbia YAC 
library. 

The YAC library was screened in two steps. Firstly, the cells of the YAC clones of 
each plate of the twelve 96-well microtiter plates were pooled (a plate pool) and used for 
DNA isolation as described by Ross et at (1991 , Nucleic A cids Res. 19. 6053). The pools 
were screened with both AFLP markers. Subsequently, from each positive plate pool, the 
DNA samples of each row (a pool of 8 clones) and of each column (a pool of 12 clones) 
were screened with the AFLP marker for which the ptate pool was positive. In this way, the 
individual positive YAC clones could be identified. The screening yielded a total of 4 YAC 
clones: YAC 12F04 and YAC 12H07 were isolated using the North AFLP marker W83.1 , 
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and YAC 10G07 and YAC 7E03 using the south AFLP marker C86 (for the nomenclature of 
the YAC clones the C!C numbering is used). The YACs were -fingerprinted- by AFLP, g,v,ng 
YAC-specific AFLP fragments. Fingerprints of the YACs were compared and used to 
estimate overlaps between the YACs (see also Tab.es 5 and 6). Based on the AFLP 
fingerpnnts, clone 7E03 is essentially covered by clone 10G07 (see also table 5) and clone 
12H07 is likewise essentially covered by clone 12F04 (see also table 6). 

ii. Generation of AFLP markers from YAC clones. 

Since the AFLP markers described above were genetically relatively far from the 
N,M1 gene (see figure 3). additional AFLP markers were deve.oped ,n an effort to find 
markers that were closer to the NIM gene. 

A screening for additional YAC-denved AFLP markers was performed on DNA 
samples of the following: DNA of the isolated YAC clones (4 YACs were identified, as 
described above), the yeast stra.n without a YAC. and the three Arabidopsis ecotypes Co!. 
Ler and Ws. In this way. the fragments specific for the YAC clones (absent in the yeast 
strain and present in Col) could be tested for po.ymorphism in Ler and Ws (the parents of 
the recombinant plants identified in Example 1.5 below). All identified polymorphic 
fragments would thus be additional AFLP markers. In the first AFLP screening the enzyme 
combination (EC) EcoR./Msel was used. In this screening two YAC clones, 10G07 and 
7E03 (detected w,th AFLP marker C86. see below), the yeast strain without a YAC and the 
three Arabidops.s ecotypes Col, Ler and Ws were assayed. The primer combinations with 
the selective extensions used can be d.v.ded into three groups and are depicted in Table 4. 
A total of 256 (64 + 96 + 96) primer combinations were screened 

In Table 4 below the primer sequences used in the AFLP screening of two YAC 
Clones, 10G07 and 7E03, the yeast stra.n without a YAC. and the three Arabidops.s 
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ecotypes Col. Ler and Ws are shown. Three groups of primer combinations have been 
used. An "NT in the primers indicates that this part was variable (A, C, G or T), an "S" 

indicates CorG.a "W" indicates A or T, and a "Y" indicates C or T. 

Table 4 



EcoRI-primers- 

5' GACTGCGTACCAATTCGW-3' 

5 ' G ACTGCGT ACC A ATTCTS - 3 
Msel-pnmers: 

5'-GATGAGTCCTGAGTAAAAS-3' 
5 '-G ATG AGTCCTG AGT A A AS A-3' 
5'-GATGAGTCCTGAGTAAATN-3' 
5'-GATGAGTCCTGAGTAACAN-3' 
5'GATGAGTCCTGAGTAACTN 3' 



EcoRI-pnmers: 

5-GACTGCGTACCAATTCAN-3' 
5'-GACTGCGTACCAATTCCW-3' 
5'-GACTGCGTACCAATTCTW-3' 
Msel-pnmers: 

5'-GATGAGTCCTGAGTAAAAS-3' 
V - G A T G A G T C C T G A G T A A AS A-3 ' 
5-GATGAGTCCTGAGTAAGAY-3' 
5'-GATGAGTCCTGAGTAAGTW-3' 
5 ' -G ATG AGTCCTG A GT A ATC G - 3 ' 
5'-G ATG AGTCCTG ACTA ATCT- 3' 
5 ' G ATG AGTCCTG AGT A ATG W - 3 ' 
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EcoRI-primers: 

5'-GACTGCGTACCAATTCGW-3' 

5'-GACTGCGTACCAATTCTN-3' 
Msel-pnmers: 

5'-GATGAGTCCTGAGTAAGAW-3' 

5'-GATGAGTCCTGAGTAAGCW-3' 

5 -GATGAGTCCTGAGTAAGTW-3' 

5'-GATGAGTCCTGAGTAATAN-3' 

5-GATG AGTCCTGAGTAATCW-3' 

5-GATGAGTCCTGAGTAATGW-3' 

5'-GATGAGTCCTGAGTAATTS-3' 

In total, 83 Col-specific fragments were generated, of which 62 were shared by both 
YAC Cones. Three fragments were AFLP markers polymorphic between Ws and Ler, of 
wh,ch two were Ws AFLP markers (a Col fragment also present ,n Ws and absent in Ler) 
and one was a Ler AFLP marker (a Col fragment a.so present in Ler and absent ,n Ws). 
These results are presented in Table 5 below. 

Table 5 shows a number of shared and unique AFLP fragments detected in YACs 
10G07 and 7E03 and the number of informal AFLP markers among these fragments in 
Ws and Ler genotypes. 
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Table 5 



AFLP fragments in YAC clones AFLP marker 



10GO7 



7E03 



Ws 



Ler 



shared 



62 



62 



2 



unique 



21 



0 



0 



0 



This AFLP analysis thus yielded 3 new AFLP markers (see Figure 4 and below). 
Their positions relative to each other and relative to the recombinant, inbred-line-derived 
AFLP markers were determined by analysis of the recombinants with these AFLP markers. 

A second screening for AFLP markers was performed assaying a!! four identified 
YAC clones (see below) and using the enzyme combination Pstl/Msel. The primers used 
are: 

Pstl-primers: 

5-GACTGCGTACATGCAGAN-3' 
5-GACTGCGTACATGCAGCW-3' 
5'-GACTGCGTACATGCAGGW-3' 
5'-GACTGCGTACATGCAGTN-3' 

Msel-pnmers: 

5-GATGAGTCCTGAGTAAAN-3' 
5' GATGAGTCCTGAGTAACW-3' 
5'-GATGAGTCCTGAGTAAGW-3' 
5' -GATGAGTCCTGAGTAATN-3' 

An "N ' in the primers indicates that this pari was variable (A. C, G or T) and a rt W in 

the onmers indicates that this was A or T A total of 144 (12 x 12) primer combinations was 
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screened on al, four IS o.ated YAC clones. 12F04, 12H07. 10G07 and 7E03; ,ne yeast strain 
without a YAC; and the three Arab.dopsis ecctypes Co,, Ler and Ws. In total. 219 AFLP 
fragments were generated, of which 144 were present in YAC clones 12F04 and 12H07 (72 
were un, q ue for Cone 12 F04 and 72 were shared between both YACs) and o, which 75 
were present in YAC Cones 10G07 and 7E03 (33 were unique for clone 10G07 and 42 
were shared between the 2 YACs). Three fragments derived from the first set of YAC 
ciones were polymorphic (Ws AFLP markers). These resu.ts are presented in Tab.e 6 
below. 

Tab,e 6 Ns,s ,he number o, shared and un.cue AFLP fragments detec.ed in YACs 

and the number of informatlvp afi p r^o^^~ 

ormatrve AFLP markers among these fragments in Ws and Ler 

genotypes. 



Table 6 

number of AFLP fragments in YAC clones 
12F04 I2H07 I0G07 



AFLP markers 



7E03 



Ws Ler 



shared 


72 


72 


0 


0 


I 


0 


unique 


72 


0 


0 


0 


2 


0 


shared 


0 


0 


42 


42 


0 


0 


unique 


0 


0 


33 


0 


0 


0 



The results indicate tha, YAC Cone , 2 H07 ,s par, of the farger YAC Cone , 2 F04 

-be iarger YAC Cones, 1 2 F04 and ,0G07. do no, overiap These data do no, affow the 
pos,„on,ng o, the NIM1 gene on any „, these YAC Cones. The whoie screen.ng. ,nvo,v,ng 
400 pnmer comb,na„ons producng 302 AFLP fragmen.s ,n the NIM reg,on. yie ,ded 5 usefu, 
AFLP markers, of which 4 were W S -spec,„c and one Ler-spec,„c. These 5 add,„ona, AFLP 
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markers have been mapped by analysis of recombinant plants (see figure 4 and below) and 
are denominated W84.1 (a.k.a. W83.3). W84.2, W85.1 , W86.1 and L86. 



Table 7 lists the primer sequences used to obtain these AFLP markers. These 5 
additional AFLP markers raised the total number of AFLP markers to 12 in the region from 
L81 .1 to L88 (see figure 4 and below). 

Table 7 shows primer combinations of AFLP markers derived from YAC clones. 

"EcoRI-" refers to the sequence 5'-GACTGCGTACCAATTC-3\ 
"Msel-" refers to the sequence 5'-GATGAGTCCTGAGTAA-3' and 
"Pstl-" refers to the sequence 5'-GACTGCGTACATGCAG-3'. 

Table 7 

AFLP marker Pnmer combination with selective extensions 



W84.1 Pstl- AT Msel-TT 

W84.2 Pstl-AA Msel-TT 

W85.1 EcoRI-CT Msel-GTA 

W86.1 EcoRI-GT Msel-CTT 

L86 EcoRI-GT Msel-CTT 



This information was used to construct a physical map of the region, as snown in 
Figure 5. with approximate positions of the YAC clones, relative to the genetic map The 
map showed that the region containing the NIM1 locus, between markers W83.1 and 
W85.1, is partly covered by 3 YAC clones: 12F04 and 10G07/7E03. 
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... Construclion o, a P1/BAC conlig comain.ng the MM, gene. 

in the previous seCons i, was described now AFLP markers linked to ,he NIM1 
region were ,so,a,ed and now VACs corresponding t0 these markers were ,den,„ied and 
mapped. The results ob,ained wh„e locating ,he N,M, gene to a chromosome .ragmen, 
did no, anew ,he dehnihon o, a specie DNA segmen, containing the NIMJ gene: the 
Hanking AFLP markers were mapped ,o di,,eren, VACs ,ha, d,d no, overlap. „ was 
,here,ore. no, possibie ,o de,erm,n, ,ne precse physicai position of ,ne Amu, gene- ,, could 
be ,oca,ed on e„her ., ,he ,wo VACs or ,n ,he gap be,ween ,he YACs. An a„e,na„ve 
approach was selected ,o Cose ,he pbys.ca, gap be,ween ,he tlanking markers: a P, and 
SAC library were empioyed ,o bridge ,he gap between ,he ,lank,ng AFLP markers. 

The libranes used tor gap closure were an Arab.dopsis ecotype Columbia P, i.brary 
descr.bed by Liu e, a, (Th, Hm, j. 7. 35,-358. ,9 95 , and an ecotype Columb.a BAC l.brary 
described by Cho, e, a, (ht.p/genome-www.stantord.edu/Arab.dopsisWVo*, cho, h,m„ 
The P, ,ibrary cons,s,s o, abou, ,0.000 Cones wi,h an average insert s,ze o, 80 kb and ,he 
BAC library consis.s o, abou, 4000 clones „,,n an average insen s, 2 e 0, ,00 kb. In .heory 

these libraries represent about m nn^i^n 

P about 10 nuclear genome equivalents (assuming a haploid 

genome size for Arabidopsis of 120 Mb). 
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iv. Identification of P1 clones corresponding to the flanking markers. 

The flanking markers Ws84.2 and Ws85.1 were used to screen pools of P1 clones 
using a similar strategy as previously described for screening of the YAC library (see 
Example i.5i). Pi clones having the marker fragments were selected and "plasmid" DNA 
was isolated. The various P1 clone DNAs were fingerprinted using the ECs EcoRI/Msel and 
Hindlll/Msel and primers without selective nucleotides. A physical map was constructed, i.e. 
a map giving the size and overlaps of the clones, by comparing the AFLP fingerprints. The 
numoer of AFLP fragments that are unique and the number of AFLP fragments that are 
common between clones indicate the extent of the overlaps. The map is displayed in Figure 
6. The AFLP fingerprinting revealed that two sets of non-overlapping P1-contigs had been 
constructed each containing one of the flanking markers: P1-1 and P1-2 containing marker 
Ws84.2; P1-3 and P1-4 containing marker W85.1. Consequently, the gap between the 
flanking markers was not closed (Figure 6). 

The positions of the P1 contigs with respect to the YAC contig was determined by 
AFLP fingerprinting of the YACs and P1 clones with a number of YAC-specific PCs 
described above. P1 clones P1-1 and P1-2 appeared to overlap completely with YAC 
CIC12F04, but only partially with YAC CIC12H07. Therefore, the latter P1 clones could be 
positioned on the YAC contig CIC12H07/12F04 (Figure 6). P1 clones P1-3 and P1-4 
overlapped completely with both YACs CIC7E03 and CIC10G07 and it appeared that AFLP 
marker W86.1 , like W85.1 , was mapped to this P1 contig (Figure 6). 

Next, marker L85 was used to identify corresponding P1 and BAC clones. L85 is an 
ecotype-Landsberg-specific marker and, therefore, colony hybridization of radioactively 
labelled L85 DNA to PI and BAC filters was employed. Not a single P1 or BAC clone 
hybridizing to L85 was identified. This supported our earlier findings that the L85 sequence 
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;s lacK.ng „. ,r.e Arao^,r. s ecotype Columbia genome and is. therefore, the most likely 
explanation of why no corresponding clones were identified. 

v. Extending the NIM 7-flanking P1 Contigs. 

Various approaches were employed to extend from the flanking P1 contigs: 
YAC AFLP fragments specific to the South end of YAC CIC12F04 (unique to 
C.C12F04, not present in C.C 12H07) were used to identify P1 Cones by AFLP screening of 

pools of the library. 

1. YAC AFLP fragments from YAC 10G07 and overlapping with P1-4 were 
used to identify P1 clones by AFLP screening of pools of the P1 library. 

2. EcoRI restriction fragments from P1 done P1-6 (resulting from the AFLP- 
based P1 library screening of step t) were used as hybridization probes on 
filters of the BAG library. 

Vanous Pi and BAC Cones resu.ted from this screening and all were AFLP- 
fngerpnnted w.th the ECs EcoR./Mse. and Hind.H/Mse. us.ng pnmers w.thou, selective 
nucleotides. A new map was constructed as described above and is dep.cted in Figure 7. 
Table 8 shows the vanous AFLP PCs having AFLP fragments mapped to f,an k ,ng YACs 
and used to screen the Pi-library for corresponding P1 clones. 

Table 8 represents the vanous AFLP PCs used to screen the P1 .ibrary. The top half 
of the table shows PCs specific for the North YACs and the bottom half shows the PCs 
speafc for the South YACs. Also indicated are the YACs and Pt Cones where.n the AFLP 
fragments were detected. 
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Table 8 



AFLP PCs 

Pstl-AA MscI-TT 
Pstl-AT Msel-GT 
Pstl-CA Msel-AA 
Pstl-AC Msel-TG 
Pstl-AG Msel-TG 
Pstl-CT Msel-GT 



CIC YACs 

12F04 and 12H07 

l2F04-specific 

12F04-specific 

12F04-specific 

I2F04-specific 

12F04-specific 



PI -clones 

Pl-LPI-2 

Pl-1, Pl-2 

Pl-6 

Pl-7 

Pl-7 

Pl-7 



Comments 
Marker Ws84.2 



EcoRI-CT Msel-GT A 
EcoRI-GT Msel-CTT 
FcoRI-AA Msel-GT 
EcoRI-AT Msel-GA 
EcoRI-GG Msel-CT 



10G07 and 7E03 
10G07 and 7E03 
10G07 and 7E03 
10G07 and 7E03 
10G07 and 7E03 



PI-3, Pl-4 
Pl-4 

Pl-4, Pl-9 
Pl-4, Pl-9 
Pl-4, Pl-9 



Marker Ws85 .1 
Marker Ws86. 1 



P1/BAC contig of about 250 kb was obtained covering the South end of YAC CIC12F04 
(not extending from this YAC) and containing marker W84.2. A P1 contig of about 150 kb 
containing markers W85.1 and W86.1 was obtained; this contig is completely contained 
within YAC CIC7E03. 

Construction of a P1/BAC contig covering the NIM1 gene AFLP marker analysis on 
the recombinants with markers from the South end of the North P1/BAC contig (WL84.4 
and WL84.5, see below and table 11) showed that the previous "walking" steps were 
unsuccessful in the construction of a contig containing the NIM1 gene (see next section). 
Therefore, the existing North P1/BAC contig was extended South with the purpose of 
"walking" across the NIM1 gene, which would enable the definition and isolation of a 
specific DNA segment containing the NIM1 gene A hybriaizatton-oased approach was 
followed in which P1 or BAC clones located at the South end of the North P1/BAC contig 
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were used to -dentify Cones positioned Coser to NlM, (South oound). New cones resuitmg 
from the walking steps were mapped with respect to the existing contigs us.ng AFLP 
f-ngerpnntmg with rn e ECs EccR,/Msel and H,nd„,/Mse, as described above. A tota, o, 5 
subsequent walking steps appeared to be necessary to "cross" the N,M1 gene. Table 9 
shows the clones obtained in the various walking steps. 

Table 9 is an overview of the vanous wa.k.ng steps showing the hybridization probe 
used to screen tne Pi and BAG .ioranes and the se.ected cones hybndiz.ng to the probes 
and extending m the South direction. 



Table 9 



New clones extending Sr,.,rh 
-Step I P| .7 BAC-02 

Step 2: BAC-02 P1-16.BAC-03 

Step 3: BAC-03 P 1 - 1 7. PI - 1 8 

S,e P 4: P1 -'8 PI -21. PI -20, BAC-04 

Ste P 5: BAC-04 P ,. 22 . P ,. 2 3. Pl-24. BAC-06. BAC-05 

A physical map of the vanous clones resulting from th,s w alking effort is depicted ,n 
Figure 8. A tota, distance of about 600 kb was covered starting from the ,n,t,a. walking point 
-rker WB4., The South end of the contig presented ,n P, gure e appeared to contain the 
NIM1 gene (see next section). The cont.g extends more than 300 kb South from VAC 
CIC12F04 and appeared not to overlap w,th YACs C.C10G07 and CC7E03, indicating that 
the NIM1 gene „ ,n the gap between the f.ank.ng YAC cont,gs and that ,h,s gap ,s at ,east 
300 kb. 
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vi. Construction of an Integrated Genetic and Physical Map of the NIM1 Region. 

In the previous sections it was described how AFLP markers linked to the NIM1 
region were isolated, how YACs corresponding to the flanking markers were identified, and 
how a P1/BAC contig was constructed extending about 550 kb South from the closest North 
flanking AFLP marker W84.2. This section describes the generation of new AFLP markers 
from the P1/BAC contig, the physical mapping of these markers on this contig and the 
genetic mapping of these markers with the available recombinants. 

1. Generation of New AFLP Markers from the P1/BAC Contig 

As described in the previous section, the P1 and BAC clones of the contig extension 

were characterized by AFLP fingerprinting using the ECs EcoRI/Msel and Hindlll/Msel. This 

defined quite accurately the extent of the overlaps between the various P1 and BAC clones 

and, in addition, generated a number of AFLP fragments specific for these clones. AFLP 

primers without selective nucleotides are used in fingerprinting of purified plasmid DNA of 

the P1 or BAC clones. Selective nucleotides will be necessary, however, to be able to use 

these Pi or BAC-specific AFLP fragments for detection in Arabidcps.s. By determining the 

end sequences of the amplified restriction fragments, AFLP primers having the appropriate 

selective bases can be designed to amplify the P1- or BAC-specific AFLP fragment in 

Arabidopsis. All AFLP fragments originate from the ecotype Columbia (Coi) and. therefore, 

it also should be determined if the Columbia AFLP markers are informative in the NIM1 

recombinants which are derived from a cross of the ecotypes Landsberg erecta (Ler) and a 

niml mutant of the ecotype Wassilewskija (Ws-nim) In principle, there are 4 types of AFLP 

fragments, two of which are useful markers as indicated in Table 10 below: 

Table 10 is an overview of the types of AFLP markers found ( + ) or (-) indicates 
presence or absence of the AFLP fragment 
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Table 10 



Col Ler Ws-mm marker-type 

not informative 
+ " Ler marker 

* Ws marker 

* * " not informative 



m genera,. „n 9 er P „„„„ g „, the P , ana BAC Cones genera.ed 30 ,o 40 EooWMse, 
' a9men,S " d 60 *° 80 H>"««>"M»I AFLP fragments for each .ndividua, Cone The 
end sequences o, ,nd,vidua, ,ragmen,s were de.erm.ned by s.andard seguencno 
~es. N.X.. spec,,, AFLP pnmers se,s w„h se,ec„ye ex.ens.ons o, 3 nM LZZ 
bo,h ,he FooR, or H,„d„, pnmer and ,he Mse, pr.me, were ,es,ed on ,he ,o„ow,ng pane, o, 

1 P1/BAC clone from which the AFLP marker was denved 
2a. Yeast 

2b. YAC clone C.C12F04 (only for AFLP fragments from P1-7) 2c Y A C Cone 
CIC10G07 COne 

3a. Coi, origin of the Pi and BAC libraries 
3b. Ler, parent 1 of the mm recombinants 
3c. Ws-nim. parent 2 of the nim recombinants 

h,«jit*z r ,e se,ec,ed ,or sequence ana,yses - ,he - E " R " M - •»« 

H,nd„„Mse, AFLP fragmems: BAC01/P1-7. P,,7,P,-,a. BAC-04/BAC-06 The AFLP 

' r0m C ' 0re PW We,e a " de ' eC,ed YAC C ' C,2FM - - done ,s 

comp,e,e, v comalned wi,h,„ ,h,s VAC. None o, ,he P„BAC-spec„,c AFLP ( ragmen,s was 

: ; "■ YA h C d0 " e Cm ~« - P,,BAC con„g Opes no- o„, 9 .L 
gap be.ween ,he ,wo „anK,n g VAC con„ gs . AFLP marKers se.eced ,o, ana ly s,s o, ,he n,m 

recombinants are depicted in Table 1 1 

Table 11 is an overview of the sefprfoH api d ( 

OT Tne s e»ected AFLP markers from the AFLP PCs specific 

-or, he .annus P, and BAC Cones A -WL" marker ,s a marker o rlg ,na„„ 9 ,om , he same 
PC and d,sp,ay,n 9 ,wo AFLP marKers. a Ws and a Ler marker, wh,ch appeared ,o be 
comp,e,e,y iinked in repu.s.on phase upon anaiys.s ol ,he MM recombinants. 
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Table 1 1 



Origin 


Marker name 


AFLP pnmers combination 


PI-7 


WL84.4 


EcoRI-AGC 


Msel-ACT 


Pl-7 


WL84.5 


Hindlll-CTC 


Msel-TTC 


P1-17/P1-18 


Ler84.6a 


Hindlll-CGT 


Msel-ATT 


P 1 - 1 IIP I - 1 8 


Ler84.6b 


Hindni-ATT 


Msel-CAT 


pi - is 


Ler84.6c 


HindNI-TCT 


Msel-TAT 


P 1 - 1 8 


Ler84.7 


EcoRI-AAA 


Msel-AGA 


BAC-04/06 


Ler84 8 


EcoRI TTC 


Msel-AGT 


BAC-06 


Ler84.9a 


EcoRI-AAA 


Msel-TGT 


BAC-06 


Ler84.9b 


EcoRI-ATC 


Msel-TCC 


BAC-06 


Ler84.9c 


EcoRI-ATG 


Msel-GTA 



2. Physical Mapping of the New AFLP Markers. 

The AFLP markers described above were physically mapped by detecting their 
presence in the various P1 and BAC clones. The results are presented in Figures 9-1 1 . 

3. Genetic Mapping of the New AFLP Markers. 

The AFLP markers were all analyzed on a selected set of the recombinants. The 
results obtained are summarized in Tables 12a, 12b and 12c. 
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The AFLP markers Ler84.8, Ler84.9a, Ler84.9b ana Ler84.9c appeared to map at 
the South side of NIM1. Recombinants were found that were phenotypically ntml 
(homozygous, genotype Ws-n/mr/Ws-n/m/) and heterozygous for these AFLP markers (the 
Ler-specific AFLP marker was detected, genotype is Ws-n/m7/Ler). AFLP marker Ler84.8 
appeared to be closest to N/M1: only a single recombinant (CMOS) was scored as 
heterozygous Ws-n/m7/Ler and homozygous \Ns-nfm1/W$-nim1. AFLP markers Ler84.7 
and Ler84.6c appeared to completely cosegregate with NIM1: all recombinants had an 
identical NIM1 and AFLP marker genotype. North of NIM1, marker L84.6b appeared to be 
closest to N/M1: three ntml phenotype recombinant plants, C-074, D-169 and E-103 (Table 
12c), were found to be heterozygous Ws-n/m7/Ler at this marker. With the aid of the 
cosmid contig generated from P1-18, BAC-04 and BAC-06, AFLP markers Ler84.6b and 
Ler84.8 were mapped in P1-18 and BAC-04, respectively, and found to have a physical 
distance of approximately 1 10 kb. This defines nim1 to be located on a DNA segment 
estimated to be 1 10 kb in length. From this analysis it has been determined that the NIM1 
gene is contained in clone BAC-04 or P1-18. Clones BAC-04 and P1-18 have been 
deposited with ATCC and given deposit numbers ATCC 97543 and ATCC 97606. 
respectively. 

vii. Genetic and Physical Fine Mapping of the NIM1 Gene. 

The previous section described how a DNA segment containing the NIMgene was 
delineated by physical mapping of the flanking AFLP markers (Ler84.6b and Ler84.8) on 
the P1/BAC contig. The flanking markers appeared to map on two overlapping clones, Pl- 
18 and BAC-04. This section describes how additional BAC-04-specific and P1 -18-specific 
AFLP markers were generated to increase the resolution of the genetic and physical map in 
tne region containing the NIM1 gene. 
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viii. Generate of Additional AFLP Markers from the Cosmid Array. 

Four ECs were se,ec,ed to generate additions. AFLP markers for fine mapping of 
N,MV. Pstl/Msel, Xba,/Mse.. BstYI/Msel and Taql/Msel. Pst,/Msel and Xbal/Mse. AFLP 
fragments were generated on clone P1-18 and BAC-04 and the se.ect.ve sequences 
necessary for detection ,n Arabidops.s were determined. Similarly, the AFLP fragments and 
selective sequences were determined for BstY./Msel and T aq ,/M Se) ; however, ,n th,s case 
the procedure was performed us.ng cosm,d DNAs: A11, C7. E1 and E8 for BstY,/Mse» 
(com Dle te NIM1 reg ,on) and D7, E8 and E6 for Taq./Mse, (South s,d e of NIMl reg ,on) 
informative AFLP markers selected for further genet.c and physical mapping are shown ,n 
Table 13. Additional adapters used m this work are shown in Table 14 

n,u, T B 13 Sh ° WS me AFLP mafkerS US6d f ° r 9en6t,C 3nd P h **«' fne mapp.ng of 

aTaTCT ." T) " ,nd,C3teS th3t " reStriCt ' 0n S ' ,e — ^ — - r 
AGATCT orGGATCT. 



Table 13 



Marker 


EC/PC 




Ler84.Y! 


BstYI(T)-GCT 


Msel-AAC 


Ws84.Y2 


BstYKT)-TCT 


Msel-GCA 


Ler84,Y3 


BsrYI(T)-AAG 


Msel TAT 


Ler84.Y4 


BstYI(T)-GTT 


Msel-AGA 


Ws84.Tl 


Taql-TAC 


Msel-GGA 


Ler84.T2 


Taql-TTG 


Msel-GGA 
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Table 14 shows same Additional adapters used for identifying new AFLP markers. 

Table 14 

BstYI: 5'-CTCGTAGACTGCGTACC-3' 

3'-CATCTGACGCATGGCTAG-5' 
TaqI: 5-CTCGTAGACTGCGTACC-3' 

3'-CATCTGACGCATGGGC-5' 



ix. Physical mapping of new AFLP markers to the cosmid contig. 
The markers indicated above were physically mapped on the cosmid array by 
determining their presence in the various cosmid clones (Figure 11). 

1 . Genetic Mapping of New AFLP Markers. 

The new AFLP markers were genetically mapped by AFLP analysis of the closest 
North and South recombinants. The closest North (recombinant D169) and South 
(recombinant C105) recombination points were mapped (see Table 15). The AFLP analysis 
showed that recombinant D169 had a recombination South of marker L84.Y1. but North of 
marker W84.Y2. The recombination point in recombinant C105 mapped between markers 
L84.T2 and L84.8. Using the available set of recombinants this allowed further delineation 
of the chromosomal interval containing NIM1. the distance oetween the flanking 
recombination points appeared to be 60-90 kb (Figure 12). 
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2. Construction of a Cosmid Contig. 

For complementation of the ntml plant phenotype. transformation of nim1 plants is 
required with a wiidtype NIM1 gene. This can be accomplished by transforming these 
plants with a cosmid containing the gene. For this purpose, a cosmid contig of the NIM1 
region is constructed. Since Arabidopsis is transformed using Agrobacterium, the cosmid 
vector used is a binary vector. 

DNA was isolated from BAC-04, BAC-06 and P1-18, and used to make a partial 
digest using restriction enzyme Sau3AL The 20-25 kb fragments were isolated using a 
sucrose gradient, pooled, and filled in with dATP and dGTP. The binary vector (04541 ) 
was cleaved with Xhoi and filled in with dCTP and dTTP. The fragments were next ligated 
into the vector. The ligation mix was packaged and transduced into E.coli. 

This cosmid library was screened with the BAC-04, BAC-06 and P1-18 clones and 
positive clones isolated. These cosmids were next AFLP fingerprinted and arranged into a 
contig of overlapping clones spanning the NIM1 region. The insert sizes of the cosmids 
were determined, and limited restriction mapping was performed. The results are shown in 
Figure 10. 
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Example 2 

Identification of a Clone Containing the NIM1 gene 
1 . Complementation Via Stable Transformation 

Cosm.ds that are generated from clones that span the NIM1 reg.on (descnbed 
above) are moved into Agrobactenum by triparental mating These cosmids are then used 
to transform n,m1 Arabrtops.s by vacuum infiltration (Mindnnos et al.. 1 994, Cel] 78, 1089- 
1099) or by standard root transformation. Seed from these plants ,s harvested and al.owed 
to germnate on agar plates with kanamycin (or another appropr.ate antibiotic, as selection 
agent Only p,antlets that are transformed w„h cosm.d DNA can detoxify the selects agen, 
and survive. Seedlings that survive the selection are transferred ,0 soil and tested for the 
n,m phenotype or their progeny are tested for the nim phenotype. Transformed o,ants that 
no longer have the n,m phenotype identify cosmos) that contain a functional NIM1 gene. 

2 Complementation in a Transient Expression System 

The ability of DNA clones to complement the n, m i mutation ,s tested in 2 transient 
expression systems. 

in the f.rst system, n,m1 Arabidops.s plants containing a PR1-Iuc.ferase (PRL.ux) 
transgene are used as bombardment recipient material. These p.ants are generated by 
transforming Columbia ecotype plants with a PR1- Iux construct by vacuum .nfi.trat.on. 
followed by kanamycin selection of the harvested seed, as descnbed above Transformed 
Plants that express luciferase activity after induction w„h INA are selfed and homozygous 
P.ants are generated. These are crossed to nim1 p.ants. in the transient assay, progeny 
Plants from th,s cross that are homozygous for n,ml and for PR,., UX are used for 
.dentif.cation of DNA clones that can complement the niml phenotype. To th,s end, the 
Plants are first treated w,th INA, as descnbed in example 1.1 above. Two days later these 
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plants are harvested, surface sterilized and plated on GM agar medium. The leaf tissue is 
then bombarded with cosmid, P1 or BAC clones (or subclones) from the NIM1 region and 
after one day, the iucjferase activity of the leaves is measured. Clones that induce 
luciferase activity contain the NIM1 gene. 

In a second system, niml plants are treated with INA (as described in example 1.1 
above) and 2 days later bombarded with cloned DNA (cosmid. P1. BAC and/or YAC clones 
or subclones) from the NIM1 locus region and a reporter piasmid. The reporter piasmid 
contains the luciferase gene, driven by the Arabidopsis PR1 promoter (PR1-Iux). In niml 
plants, INA does not activate the PR1 promoter (as described in example 1.2 above) and 
thus can not induce luciferase activity from the reporter piasmid. However, when a 
cotransformed DNA clone contains the complementing NIM1 gene, INA does induce the 
PR1 promoter, as evidenced by an induction of luciferase activity. One day after the 
cobombardment, the luciferase activity of the whole plant is measured. DNA clones 
(cosmids, P1 or BAC clones or subclones) that induce luciferase activity that is significantly 
above background levels contain the NIM1 gene. 

3. Changes in transcripts m ntrni phenotype lines. 

Since niml phenotype plants have mutations in the NIM1 gene, it is conceivable that 
in some lines the gene is altered in such a manner that there is no mRNA transcribed, or an 
aberrant mRNA (size) is produced. To test for this, RNA blot analysis is performed on the 
niml lines. 

RNA is isolated from Ws and Ler plants of these lines, (after water or INA or BTH 
treatment) and used to prepare northern blots. These blots are hybridized with DNA 
fragments isolated from clones of the DNA contig of the NIM1 locus DNA fragments that 
identify niml lines with aberrant RNA expression (aberrant in size or concentration), likely 
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identify (par: of) the NIM1 gene. The DNA fragment and surrounding DNA is sequenced 
a-.« used ,o isolate a cDNA (by library screen or by reverse transcription-PCR), which is 
also sequenced. The clone from which the fragment was .solated or the isolated cDNA is 
used to show complementation of the n,ml phenotype in stable and transient expression 



systems. 



Example 3 

Determination of the DNA sequence of the NIM1 gene. 
1 . Genomic sequencing. 

Genom.c clones that may contain the NIM1 gene are sequenced us,ng methods 
known in the art. These include BAC-04, P1-1 8 and the cosm.ds from the NIM1 reg.on. For 
-stance, the cosm.ds are digested with restriction enzymes and fragments that are derived 
from the insert are Coned into a genera, purpose vector, such as P UC18 or Bluescr.pt. The 
larger P1 and BAG clones are randomly sheared and fragments cloned into a genera, 
purpose vector. The fragments ,n these vectors are sequenced by conventiona, methods 
(e.g. by -primer walking" or generation of de.etions of inserts). The obtained sequences are 
assembled tnto a contiguous sequence. 

The sequence of the insert of a complementing clone contams the NIM1 gene. The 

approximate start and end of the N,Ml gene are deduced based on the DNA sequence. 

sequence motifs such as TATA boxes, the open reading frames present ,n the sequence. 

codon usage, the cosm.d complementation data, the relative ,ocat,on of the AFLP markers 

and additional relevant data that is gathered (see Example 4. below). 
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2. cDNA sequencing. 

The cosmid(s) or larger clones that contain the NIM1 gene (as described in Example 
2 above), are used to isolate cDNAs. This is accomplished by using the clones (or DNA 
fragments) as probes in a screen of a cDNA library of wiidtype Arabidopsis plants. The 
cDNAs that are isolated are sequenced as described for cosmid sequencing and used in 
complementation tests. To this end, full length cDNAs are cloned into a suitable plant 
expression vector, behind a constitutive promoter. These constructs are used in the 
transient assays as described above. Alternatively, the cDNAs are cloned into a binary 
expression vector, allowing for expression in plant tissues and for Agrobactenum- mediated 
plant transformation, as described in Example 2 above. A cDNA that contains the NIM1 
gene (as determined by complementation, isolation with a closely linked AFLP marker, 
isolation with a cosmid fragment, or by other deduction) is sequenced. 

The genes from Ws-O and nim1 plants are isolated and sequenced. The genes are 
obtained from a cosmid of cDNA library, using a fragment of the isolated NtM1 gene as a 
probe. Alternatively, the genes or cDNAs are isolated by PCR, using A///W7-gene-specific 
primers and genomic DNA or cDNA as template. Likewise, the nim1 alleles from other nim1 
lines (see Example 1.1 above) are isolated and sequenced in a similar manner. 

Example 4 

Description of the NIM1 gene and deduced protein sequence 
The DNA sequence of the NIM1 gene or cDNA is determined as described in 
Example 3 above. This sequence is analyzed with the use of DNA analysis programs, such 
as can be found in the Genetics Computer Group (GCG) package, in the Sequencer or 
Staden packages, or any similar DNA analysis program package. 
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Spec,fica.,y, u,e start and end of the gene are ae.erm.nec, based on open read.ng 
frame analyse, the presence of stop and potential start codons. the presence of potent.a, 
promoter motifs (such as the TATA box), the presence of P olyaden yl at,on s.gnais and the 
like. Also, the predicted am.no acd is deduced from the open reading frame. Both the DNA 
and prote.n sequence are used ,o search databases for sequences with homolog.es. such 
as transcr.pt.on factors, enzymes or motifs of such genes or prote.ns. 



Example 5 
Isolation of N/M1 homoiogs 
The A rabld o P s,s NIM1 gene may be usee as a p,obe in ,he low stringency 
hybr,d,za„on screening o. a genomic or cDNA library ,n order ,o ,so,a«e N,M1 homoiogs 
from oiher pian, speaes. Aiternatively. this is accomplished by PCR amplication, using 
primers designed based on ,he AnbMopsis N1M1 gene sequence and us,ng genom.c DNA 
or cDNA as template. The NIM 1 gene may be isolated „om corn. whea,. rice, bariey.rape 
seed, sugarbeet. potato, tomato, bean, cucumber, grape, tobacco and other cops o, 
interest and sequenced. With a so, of sequences from N IM , gene homoiogs ,„ hand, nev, 
primers can be designed from conserved portions of the gene, in order to isoiate N, M , 
homoiogs trom more distantly related plant species by PCR amplification. 

Example 6 

Complementation of the nimi-1 gene with genomic fragments. 
1 . Construction of a cosmid contig. 

A cosm.d con.g of the N,M1 reg.on was constructed us.ng CsCl-purified DNA from 

BAC04, BAC06 and P1-18. The DNAs of th P thr aa 

e unas of the three clones were mixed in equimolar 

quantities and were part.ally d.gested w.th the restnction enzyme Sau3A The 20-25 kb 
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fragments were isolated using a sucrose gradient, pooled and filled in with dATP and dGTP. 
Plasmid pCLD04541 was used as T-DNA cosmid vector. This plasmid contains a broad 
host range pRK290-based repticon, a tetracycline resistance gene for bacterial selection 
and the nptll gene for plant selection. The vector was cleaved with Xhol and filled in with 
dCTP and dTTP. The prepared fragments were then ligated into the vector . The ligation 
mix was packaged and transduced into E. coli strain XL1-blue MR (Stratagene). Resulting 
transformants were screened by hybridization with the BAC04, BAC06 and P1-18 clones 
and positive clones isolated. Cosmid DNA was isolated from these clones and template 
DNA was prepared using the ECs EcoRI/Msel and Hindlll/Msel. The resulting AFLP 
fingerprint patterns were analyzed to determine the order of the cosmid clones. A set of 15 
semi-overlapping cosmids was selected spanning the nim region (Figure 13). The cosmid 
DNAs were also restricted with EcoRI, Pstl, BssHII and SgrAI. This allowed for the 
estimation of the cosmid insert sizes and the verification of the overlaps between the 
various cosmids as determined by AFLP fingerprinting. 

2. Identification of a clone containing the NIM gene 

Cosmids generated from ciones spanning the NIM1 region were moved into 
Agrobacterium tumefaciens AGL-1 through conjugative transfer in a tri-parental mating with 
helper strain HB101 (pRK2013). These cosmids were then used to transform a kanamycin- 
sensitive nim 1-1 AraoiOoosis line using vacuum infiltration (Mindrinos et ai. ( 1994, Cell 78, 
1089-1099) Seed from the infiltrated plants was harvested and allowed to germinate on 
GM agar plates containing 50 mg/ml kanamycin as a selection agent. Only plantlets that 
are transformed with cosmid DNA can detoxify the selection agent and survive. Seedlings 
that survive the selection were transferred to so:! approximately two weeks after plating and 
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tested for the nimi phenotype as described below. Transformed plants that no longer have 
the nimi phenotype identify cosmid(s) that contain a functional NIM1 gene. 

3. Testing for the nimi phenotype of transformants. 

Plants transferred to soil were grown in a phytotron for approximately one week after 
transfer. 300 M m INA was applied as a fine mist to completely cover the plants using a 
chromister. After two days, leaves were harvested for RNA extraction and PR-1 expression 
analysis. The plants were then sprayed w.th Peronospora parasitica (isolate EmWa) and 
grown under high humidity conditions in a growing chamber with 1 9°C day/17° night 
temperatures and 8h light/16h dark cycles. Eight to ten days following fungal infection, 
plants were evaluated and scored positive or negative for fungal growth. Ws and nimi 
plants were treated in the same way to serve as controls for each experiment. 

Total RNA was extracted from the collected tissue using a LiCI/phenoi extraction 
buffer (Verwoerd, et al. NAR 17:2362). RNA samples were run on a formaldehyde agarose 
gel and blotted to GeneScreen Plus (DuPont) membranes. Blots were hybridized with a 
32 P-labeled PR-1 cDNA probe. The resulting blots were exposed to film to determine wh.ch 
transformants were able to induce PR-1 expression after INA treatment. The results are 
summarized in Table 16. 
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Taole 16 shows complementation of the nim1 phenotype by cosmid clones. 



Table 


16 




, Clone Name 

i 


j # of transtormams 


# of plants with IN A induced 
PR- 1 /total # of plants tested (%) 


j A8 


3 


0/3 (0%) 


All 


8 


4/18 (22%) 


! C2 

i , 


10 


1/10(10%) 


j C7 


33 


1/32 (3%) 
I | 


D2 


81 


4/49 (8%) 


D5 


6 


5/6 (83%) 


EI 


10 


10/10(100%) | 


D7 


129 


36/36(100%) | 


E8 


9 


0/9 (0%) 


F12 


6 


0/6 (0%) 


E6 j, | 

! 


0/1 (0%) ! 
1 


, — . i 

F / i 

i 

i . j 


34 


j 


WS-control (wild-type) 


NA 


28/28 (100%) | 

i 


niml-l phcnoiypc control 

1 


NA 

1 


0/34 (0%) j 

i 



N'A-not applicable 



Example 7 

Sequencing of the 9.9 Kb NIM 1 gene region. 
BAC04 DNA (25 ug, obtained from KeyGene) was the source of DNA used for 
sequence analysis This BAG was shown to be tre cione complete 11 / encompassing the 
region that complemented the mm) mutants DNA was randomly sheared using an 
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approach from Cold Spnng Harbor. Briefly. BAC DNA was shearea in a nebul.zer to an 
average molecular we.ght of about 2 kb. Ends of the sheared fragments we-e repaired 
using a two-step protocol with dNTPS, T4 DNA polymerase and Klenow fragment 
(Boehnnger). The end-repa.red DNA was run on a 1% | OW -melt agarose gel and the region 
between 1.3 kb and 2.0 kb was cut from the gel. DNA was iso.ated from the gel fragment 
by a freeze-thaw approach. DNA was then m,xed with EcoRV-digested pBRKanF4 and 
was ..gated overn.ght at 4 3 C. pBRKanF4 ,s a derivative of pBRKanFl. wh,ch was obtained 
from Kolav, Bhat at Vanderb.lt Un,vers,ty (Bhat. K.S., Gene 134(1). 83-87 (1993)). E. col, 
stra,n DH5a was transformed with the iigat.cn mix. and the transformation m,x was plated 
onto plates containing kanamycin and x-gal. 1600 white or light blue KanR colon.es were 
selected for plasm.d isolation. Individual colonies were p.cked into 96-well deep well plates 
(Po.yfiltron.cs. #U508) conta,n,ng 1 .5 ml of TB + Kan (50 ug/m.). Plates were covered and 
were placed on a rotating platform shaker at 37°C for 16 hrs. P.asm.d DNA was isolated 
us,ng the Wizard Plus 9600 M.niprep system fPromega, # A7000) according to 
manufacturer's recommendations. 

Plasm.ds were sequenced us.ng Dye Term.nator chemistry (Applied BioSystems 
PRISM Dye Term.nator Cycle Sequencing Ready Reaction K.t. P/N 402078) and primers 
designed to sequence both strands of the plasmids. Data was collected on ABI 377 DNA 
sequencers. Approximately 75% of these reactions yielded useful sequence information 
Sequences were edited and were assembled ,nto cont,gs using Sequencer 3.0 fGene 
Codes Corporation). Staden gap4 (Roger Staden, e-mail address rs@mrc-lmb.cam.ac.uk). 
and PHRED (PhH Green, e-ma.l address phg@ u wash.ngton.edu). The largest cont,g 
(approximately 76 kb) covered the complementing reg.on tc an average depth of 7 
independent calls/base. 



SUBSTITUTE SHEET (RULE 26) 



W O 97/49822 



PCI7EP97/012I8 



- 67 - 

A region of approximately 9.9 kb defined by the overlap of cosmids E1 and D7 was 
identified by complementation analysis to contain the nim1 region. Primers which flanked 
the insertion site of the vector and specific to the cosmid backbone were designed using 
Oiigo 5.0 Primer Analysis Software (National Biosciences, inc.). DNA was isolated from 
cosmids D7 and E1 using a modification of the ammonium acetate method (Traynor, P.L., 
1990. BioTechnidues 9(6): 676.) This DNA was directly sequenced using Dye Terminator 
chemistry above. The sequence obtained allowed determination of the endpoints of the 
complementing region. 

A truncated version of the BamHI-EcoRV fragment was also constructed, resulting in 
a construct which contains none of the "Gene 3" region (Fig. 13). The following approach 
was necessary due the presence of Hindlll sites in the Bam-Spe region of the DNA. The 
BamHI-EcoRV construct was completely digested with Spel, then was split into two 
separate reactions for double digestion. One aliquot was digested with BamHI, the other 
Hindlll. A BamHI-Spel fragment of 2816 bp and a Hindlll-Spel fragment of 1588 bp were 
isolated from agarose gels (QiaQuick Gel extraction kit) and were ligated to BamHI-Hindlll- 
digested pSGCGOL DH5a was transformed with the ligation mix. Resulting colonies were 
screened for the correct insert by digestion with Hindlll following preparation of DNA using 
Wizard Magic MiniPreps (Promega). A clone containing the correct construct was 
eiectroporated into Agrobactenum strain GV3101 for transformation of Arabidopsis plants. 
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Example 8 

Identification of the NIM1 gene region by allele sequencing. 



Table 17. Genetic segregation of non-inducib!e 



immunity mutants. 



Phenotypc 

Mutant Generation Female MaleWildtype mml 



mml - 1 ' 



Fl 
F2 



wildtype b mml -I 24 o 
98 32 



mml -2 Fi 



niml-2 Wildtype 3 



mm I 3 F i 



nimI-3 Wildtype 3 



nimi ' 4 Fl mml-4 Wildtype 3 0 

nim/ - 5 F1 nimi-5 Wildtype 0 35 



niml ' f > FI niml-6 Wildtype 3 



mm I 2 b 1 



mm! -2 mml -I {) 



15 



mm/ -3 F i 



mml -3 mm! -I () 



10 



mml -4 Fl 



mml -4 mml - 1 () 5 5 
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niml-5 Fl niml-5 ruml-i 0 14 

F2 9 85 

mmI-6 Fl mm/ -6 mm/-/ 0 (2 

4 Data from Delaney et al. (1995) PNAS 92,6602-6606. 
b Wild type denotes the wildtype Ws-0 strain. 



1. Genetic Analyses 

To determine dominance of the various mutants that displayed the nim1 phenotype, 
pollen from wildtype plants was transferred to the stigmata of nim1-1, -2, -3, -4, -5, -6. If the 
mutation is dominant, then the ntml phenotype will be observed in the resulting F1 plants. 
If the mutation is recessive, then the resulting F1 plants will exhibit a wildtype phenotype. 

The data presented in Table 17 show that wnen n/m 7-7, -2, -3, -4 and -6 are crossed 
with the wildtype, the resulting F1 exhibit the wildtype phenotype. Thus, these mutations 
are recessive. In contrast, the mml-5 X wildtype F1 progeny all exhibit the nim1 phenotype, 
indicating that this is a dominant mutation. Following INA treatment, no P parasitica 
spoliation was observed on wildtype plants, while the F1 plants supported growth and 
some sporulation of P parasitica. However, the ntml phenotype in these F1 plants was 
less severe then observed when ntm 1-5 was homozygous. 

To determine alfelism, pollen from the kanamycin-resistant niml-1 mutant plants was 
transferred to the stigmata of ntmi-2, -3, -4, -5. -6. Seeds resulting from the cross were 
plated onto Murashige-Skoog B5 plates containing kanamycin at 25 ug/ml to verify the 
hybrid origin of the seed. Kanamycin resistant (F1) plants were transferred to soil and 
assayed for the mm 1 phenotype Because the Fl progeny of the cross of the ntm 1 -5 
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mutant with the Ws w.ldtype delayed a niml phenotype, analys.s of mm1-5 X nimui F 2 
was also earned out. 

As shown in Table 1 7. all of the resulting F1 plants exh.bited the n,m1-l phenotype. 
Thus, the mutation .n the mml-2, -3, -4. -5, -6 was not comp.emented by the mml-T. these 
plants all fall within the same complementation group and are therefore allelic. Analysis of 
F2 progeny from the mm 1-5 X mm 1- 1 cross also displayed the mml phenotype. confirming 
that nim 1 -5 is a mm 1 allele 



2. Sequence Analysis and Subcloning of the NIM1 Region 

The 9.9 kb reg.on conta.nmg the NfMl region was analyzed for the presence of 
open reading frames ,n all six frames using Sequencher 3.0 and the GCG package. Four 
regions containing large ORF's were identified as possible genes (Gene regions 1-4). 
These four reg.ons were PGR amplified from DNA of the wHd-type parent and six different 
a,ml allelic vanants. Primers for these amplifications were selected us-ng OHgo 5.0 
(National Biosciences. Inc.) and were synthes.zed by Integrated DNA Technologies, Inc. 
PCR products were separated on 1 .0% agarose gels and were purified using the QIAqu.ck 
Gel Extraction Kit. The purified genomic PCR products were directly sequenced using the 
primers used for the in.tial amplification and with additional primers designed to sequence 
across any reg.ons not covered by the initial primers. Average coverage for these gene 
regions was approximately 3.5 reads/base. 

Sequences were edited and were assembled using Sequencher 3.0. Base changes 
specific to various n,m1 alleles were identified only in the reg.on designated Gene Region 2. 

The P osit,ons l.sted ,n Table 18 relate to Figure 14 and relate to the top strand of the 
9.9 kb reg.on featured in Figure 13. The open reading f rames from the gene regions 
descnoed in Figure 1 3 as 1 , 2. 3 and 4 were sequenced and the changes in the d.f ferent 
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rami alleles are shown in the Table. The changes that are described are on the top strand, 
5c to 3c, as it would relate to Figure 13. 

It is apparent that the NIM1 gene was cloned and that it lies within Gene Region 2, 
since there are amino acid changes or alterations of sequence within the open reading 
frame of Gene Region 2 in all 6 nim1 alleles. At the same time, at least one of the nim1 
alleles shows no changes in the open reading frames within Gene Regions 1 , 3 and 4. 
Therefore, the only gene within the 9.9 kb region that could be N/M 1 is the Gene Region 2, 
the N!M1 gene. 

The Ws section of Table 18 indicates the changes in the Ws ecotype of Arabidopsis 
relative to the Columbia ecotype of Arabidopsis. Figures 13, 14, 15 and all others wherein 
sequence is shown relate to the Columbia ecotype of Arabidopsis, which contains the wild 
type gene in the experiments that were conducted. The changes are listed as amino acid 
changes within the gene 2 or NIM1 region and are listed as changes in base pairs in the 
other regions. 

Figure 13 shows 4 different panels that describe the cloning of the NIM1 gene and 
describe the entire 9.9 kb region. Figure 14 is the sequence of the entire 9.9 kb region in 
the same orientation as described in Figure 1 3. Figure 1 5 is the sequence o? the specific 
NIM1 gene region which is gene region 2 indicated in Figure 13; the sequence of Figure 15 
contains the NIM1 gene. Figure 15 shows the ammo acid sequence in singse letter code 
and shows the full length cDNA and RACE product that was obtained in capital letters m the 
DNA sequence. Some of the allele mutations that were found are shown above the DNA 
sequence and the particular nim1 allele that had that change is indicated. 

Sequence analysis of the region and sequencing of various niml alleles (see below) 
allowed identification of a cosmid region that contains the mrr.1 qene This region ,s 
delineated by a BamH 1 -EcoR V restriction fragment of -5.3 kb Cosmid DNA from D7 and 
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plasm.d DNA from pB.ueScr,pt.l(pBSII)were digested w,th Bam HI and with EcoRV (NEB). 
Tne 5.3 kb fragment from D7 was isolated from agarose gels and was purified us,ng the 
QIAquick gel extraction k,t (# 28796, Qiagen). The fragment was ..gated overnight to the 
Bam-EcoRV-d.gested pBSII and the ligation m.xture was transformed into E. co/, strain 
DH5a. Colon.es containing the insert were selected. DNA was isolated, and confirmation 
was made by digestion with Hind.ll. The Bam- EcoRV fragment was then engineered ,nto a 
binary vector (pSGCGOl) for transformation into Arabidops.s. 



3. Northern analysis of the four gene regions. 

Ident.eal Northern blots were made from RNA samples isolated from water, SA-, 
BTH- and INA-treated Ws and nimi lines as previously described (De.aney et al. 1995, 
PNAS 92, 6602-6606). These b.ots were hybridized with PGR products generated from the 
four gene regions identified in the 9.9 kb N,M1 gene reg.on. Only the gene region 
containing the N,Ml gene (Gene Region 2) had detectable hybr,d,zat,on with the RNA 
sampies. indicating that only the N,M1 region contains a detectab.e transcribed gene 
(Figure 16 and Table 18). 
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4. Isolation of a NIM1 cDNA. 

An Arabidopsis cDNA library made in the IYES expression vector (Elledge et al t 
1991, PNAS 88, 1731-1735) was plated and plaque (ifts were performed. Filters were 
hybridized with a 32 P-labeled PCR product generated from the gene region containing nim- 
1. 14 positives were identified from a screen of approximately 150,000 plaques. Each 
plaque was purified and piasmid DNA was recovered. cDNA inserts were digested out of 
the vector using EcoRI, agarose-gei-purified and sequenced. Sequence obtained from the 
longest cDNA is indicated in Figure 15. To confirm that we had obtained the 5c end of the 
cDNA, a Gibco BRL 5' RACE kit was used following manufacturer's instructions. The 
resulting RACE products were sequenced and found to include the additional bases 
indicated in Figure i5. The transcribed region present in both cDNA clones and detected in 
RACE is shown as capita! letters in Figure 15. Changes in the alleles are shown above the 
DNA strand. Capitals indicate the presence of the sequence in a cDNA clone or detected 
after RACE PCR. 

Example 9 
Characterization of the NIM1 gene 
The multiple sequence alignment was constructed using Clustal V (Higgins, 
Desmond Q. and Paul M. Sharp (1989), Fast and sensitive multiple sequence alignments 
on a microcomputer, CABIOS 5:151-153) as part of the DNA* (1228 South Park Street, 
Madison Wisconsin, 53715) Lasergene Biocomputing Software package for the Macintosh 
(1994). 

It has been determined that certain regions of the NIM1 protein are homologous in 
amino acid sequence to 4 different nee cDNA protein products. Tne homologies were 
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■denffied using the N,M1 sequences ,n a GenBank BLAST search. Compansons of the 
regions of homology ,n N,M1 and the nee cDNA products are shown ,n Figure 19. The 
NIM1 protein fragments show from 36 to 48% identical ammo acid sequences with the 4 
rice products. 

Example 10 

Phenotypic characterization of the various n,m1 alleles 
1 Analysis of chemical responsiveness in n,m1 alleles. 

We analyzed the differences among the various n,mi alleles in terms of chemical 
,nduct,on of PR gene expression and Peronospora parasitica resistance (see Figures 17 and 
18). 

Mutant plants were treated with chem.cal inducers and then assayed for PR gene 
expression and disease resistance. 

2. Plant growth and chemical application. 

Wild-type seeds and seeds for eacn of the n,mi alleles (mml-1, -2, -3, -4, .5, . 6) 
were sown onto MetroM.x 300 growing med.a. covered with a transparent plastic dome and 
Placed at 4° C ,n the dark for 3 days. After 3 days of 4° C treatment the plants were moved 
to a phytotron for 2 weeks. At approximate* 2 weeks post-planting, germinated seedlings 
had produced 4 true leaves. Plants were then treated with H ?0 , 5mM SA, 300 uM BTH or 
300 uM INA. Chemicals were app „ e d as a fine mist .0 completely cover the seedlings us.ng 
a cnrom,ster. Water control plants were returned to the growing phytotron wh„e the 
chemically treated plants were he.d ,n a separate but identical phytotron. After 3 days 
Plants were divided into 2 groups. One group was harvested for RNA extraction and 
analysis. The second group was inoculated with P parasitica 
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3. Peronospora parasitica inoculation and analysis. 

P. parasitica isolate 'EmWa' is a P.p. isolate that is compatible in the Ws ecotype. 
Compatible isolates are those that are capable of causing disease on a particular host. The 
P. parasitica isolate 'NoCo' is incompatible on Ws but compatible on the Columbia ecotype. 
Incompatible pathogens are recognized by the potential host, eliciting a host response that 
prevents disease development. At 3 days post-chemical application water and chemically 
treated plants were inoculated with the compatible 'EmWa' isolate. 'NoCo' inoculation was 
conducted on water treated plants only. Following inoculation plants were covered with a 
clear plastic dome to maintain high humidity required for successful P. parasitica infection 
and placed in a growing chamber with 19° C day/1 7° C night temperatures and 8h light/1 6h 
dark cycles. 

At various timepoints after inoculation plants were analyzed microscopically to 
assess symptom development. Under magnification sporulation of the fungus can be 
observed at very early stages of disease development. The percentage of plants/pot 
showing sporulation at 5d ( 6d, 7d, 1 1 d and 14d after inoculation was determined and the 
density of sporulation was also recorded. 

Figure 18 shows the disease assessment of the various niml alleles following P. 
parasitica inoculation. The most distinguishing timepoints are 5 and 6 days post- 
inoculation. At 5 days post-inoculation nim1-4 shows -80% infection under all inducing 
chemical treatments performed, clearly indicating that this allele/genotype has the most 
severe disease susceptibility. At 6 days post-inoculation, niml, -2, -3, -4 and -6 show 
significant disease incidence under all inducing chemical treatments. However, nim1-5 
shows less infection than Ws wild-type under all treatments at day 6 Therefore, nim1-5\s 
the most disease resistant of the various niml alleles. mm1-2 appears intermediate with 
respect to disease susceptibility after BTH but not the other inducing treatments 
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PR-1 gene expression indicates that n,m1-4 , s the least response to all of the 
inducing chemicals tested (Figure 17), while mm 1-5 shows elevated levels of PR-1 
express.cn in the absence of inducers. These PR-1 gene expression results are cons.stent 
with the disease assessment performed with P. P aras,t,ca (F.gure 18) and indicate that n.ml 
alleles can cause resistance or susceptibility. 

The samples obtained above were used to analyze N,M1 gene expression (Figure 
17). in w.ldtype plants N,M1 mRNA was present in the untreated control samples. 
Following treatment with SA, .NA, BTH or infection w.th a compatible pathogen the N,Hi 
mRNA accumulated to higher ,evels. Differences ,n N,M1 message (mRNA) abundance 
were observed in the n.ml alleles compared to w.ldtype. The abundance of N,M1 mRNA ,n 
untreated mutant plants was lower than observed ,n the w.ldtype with the exception of nimU 
2 and -5 where the amounts were S ,m„ ar . The niml-i. -3 and -4 had low levels of NIM1 
message while the n/mf-e had very low accumulation of N,M1 mRNA. Increases ,n N,M1 
mRNA fo.iowing SA, .NA or BTH were observed ,n n/m/-,. - 3 bu , not nim1 . 5 or 
However, this increase was less than observed in w.ldtype plants. Following pathogen 
infection additional bands hybridizing to the N.Ml cDNA probe were observed in both 
w.ldtype and mutants and the NIM1 mRNA level was elevated relative to untreated controls, 
except in mm 1-6. 

Figure 18 shows the disease resistance assessment va infection rating of the 
vanous mm , a.le.es as we,, as the NahG plants at various times after .nnoculaticn w„h 
Peronospora parasnica. WsWT indicates the Ws w„d type parent „ne ,n which the n, m1 
alleles are found. The vanous niml alleles are .nd.cated in the table and the NahG plant ,s 
:nd,cated also. Tne NahG plant has been previously published (Delaney et al. Science 
266. pp. 1247-1250 (1994)). The NahG Arabidopsis* also described ,n WO 95/19443 
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The NahG gene is a gene from Pseudomonas puttda that converts salicylic acid to 
catechol, thereby eliminating the accumulation of salicylic acid, a necessary signal 
transduction component for SAR in plants. Thus, NahG Arabidopsis plants do not display 
norma! SAR. In addition, they show much greater susceptibility in general to pathogens. 
Therefore, the NahG plants serve as a kind of universal susceptibility control. In addition, 
the NahG plants still respond to the chemical inducers INA and BTH; this is shown in the 
bottom two panels of the Figure 17. 

From Figure 18 it can be seen that the niml-4 and mm1-6 alleles seem to be the 
most severe; this is most easily observable at the earlier time points, described earlier in the 
results section herein, and from the results set forth in the EmWa BTH panel, the lowest 
panel, in the Figure. In addition, the nim1-5 allele shows the greatest response to both INA 
and BTH and therefore it is the weakest nim1 allele. 

The NahG plants show very good response to both INA and BTH and took very 
similar to the mml-5 allele. However, at late time points, Day 1 1 in the Figure, the disease 
resistance induced in the NahG plants begins to fade, and there is a profound difference 
between INA and BTH in that the INA-induced resistance fades much faster and more 
severely than the resistance induced in the NahG plants by BTH. Also seen in these 
experiments is that INA and BTH induced very good resistance in Ws to EmWa, and the 
niml-1, niml-2 and other rtiml alleles show virtually no response to SA or INA in regard to 
disease resistance. 

Figure 18 lists the percent of plants that are showing sporulation after infection with 
the EmWa race of P. parasitica, and each of the bar graphs indicates the number of days 
after infection that the disease resistance was rated. 

Northern analysis analyzing the expression of the SAR gene PR1 was also 
performed on the same samples, as shown in Figure 17 Figure 17 shows that the wild type 
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plant shows very good response to sal.cylate. INA. BTH and also to pathogen infection, as 
manifested by enhanced PR 1 gene expression. The n,m1-1 allele, on the other hand, 
shows only very limited response to all the chemical inducers including pathogen. 

The pathogen induction is at least several fold lower in the nim1-1 allele than it IS in 
the wild type. The niml-2. n,m/-3and n,ml-6 alle.es show response similar to the niml-1 
allele to the various treatments. However, the niml-4 allele shows virtually no express-on ,n 
response to any of the inducers used. Basically, background level is all that is observed. 
The n,mis allele shows a very h,gh background level relative to controls w.th water and 
that background level is maintained ,n all the treatments: however, there ,s hmited or no 
induction by the chemical inducers. 

The NahG plants serve as a good control, showing that they are unable to induce 
PR-1 m the presence of SA; on the other hand, INA and BTH both induce very strong high 
level expression of PR-1. The effect of pathogen infection ,s sim.lar to that of SA; there is 
no expression of PR-1 in the EmWa-treated NahG plants. 

These same RNA samples produced in the induction studies were also probed with 
a NIM1 gene using a full-length cDNA clone as probed. In Figure 16 it can be seen that INA 
mduces the NIM1 gene in the w.ld type Ws allele. However, the niml-1 mutation allele 
shows a lower basal level expression of the NiMl gene, and it is not inducible by INA This 
is similar to what is observed ,n the n,m1-3 allele and the n,m 1-6 allele. The n,ml -2 allele 
shows approximately normal levels in the untreated sample and shows similar rnducfon to 
that of the wild type sample, as does the n,mi-4 allele. The n,m/-5 allele seems to show 
higher basal level expression of the NIM1 gene and mucn stronger expression when 
induced by chemical inducers. 
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The induction of NIM1 by chemical inducers of resistance and other inducers is 
consistent with its roie in pathogen defense and is also further evidence that we have 
obtained the right gene in the 9.9 kb region. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATICS: 

(l) APPLICANT : 

(A) NAME: Novartis AG 

fBJ STREET: Schwarzwaldal lee 215 

(C) CITY: Basel 

(E) COUmr^Y: Switzerland 

(F) POSTAL CODE (ZIP) : 4002 

(G) TELEPHONE : +41 61 69 11 11 
;h; TELEFAX: + 41 61 696 79 76 
(I) TELEX: 962 991 



(ii) TITLE OF INVENTION : GENE CONFERRING DISEASE RESISTANCE IN 

PLANTS AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 11 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Ve 

(2) INFORMATION FOR SEQ ID NO : 1 : 

fi) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 9919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
Uii) HYPOTHETICAL : NO 
(IV) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TGATCATGAA TTGCGTGTAG GGTTGTGTTT TAAAGATAGG GATGAGCTGA AGAAGGCGGT 60 

GGACTGGTGT TCCATTAGAG GGCAGCAAAA GTGTGTAGTA CAAGAGATTG AGAAGGACGA 12 0 

GTATACGTTT AAATGCATCA GATGGAAATG CAATTGGTCG CGTCGGGC AG ATTGAATAGA 180 

AGAACATGGA CTTGTTAAGA TAACTAAGTG TAGTTGGTCC AC ATACTTGT TGTTCTATTA 2 40 

ACCCGGAAAA CTTCAACTTG TAATTTGCAG CAGAAGAGAT TGAGTGTCTG ATCAGGGTAC 300 

AACCCACTCT AACAGCAGAG TTGAAAAGTT TGGTGACATG CTTAAAACTT CAAAGCTGCG 360 

GGCAGCAGAA CAGGAAGTAA TCAAAGATCA GAGTTTCAGA GTATTGCCTA AACTAATTGG 42 0 

CTGCATTTCA CTCATCTAAT GGGCTACTTG TGGACTGCAA TATGAGCTTT TCCCTAATCC 4 80 

TGAATTTGCA TCCTTCGGTG GCGCGTTTTG GGCCTTTCCA CAGTCCATTG AAGGGTTTCA S4 0 

ACACTGTAGA CCTCTGATCA TAGTGGATTC AAAAGACTTG AACGGCAAGT ACCCTATGAA 6 00 

ATTGATGATT TCCTCAGGAC TCGACGCTGA TGATTGCTTT TTCCCGCTTG CCTTTCCGCT 6 60 

TACCAAAGAA GTGTCCACTG ATAGTTGGCG TTGGTTTCTC ACTAATATCA GAGAGAAGGT 72C 

AACACAAAGG AAAGACGTTT GCCTCGTCTC CAGTCCTCAC CCGGACATAG TTGCTGTTAT 7 80 

TAACGAACCC GGATCACTGT GGCAAGAACC TTGGGTCT AT CACAGGTTCT GTCTGGATTG 84 0 

TTTTTGCTTA CAATTCCATG ATATTTTTGG AGACTAC AAC CTGGTGAGCC TTGTGAAGC A 90 0 

GGCTGGATCC ACAAGTCAGA AGGAAGAATT TGATTCCTAC ATAAAGGACA TCAAAAAGAA 9 60 

GGACTCAGAA GCTCGGAAAT GGTTAGCCCA ATTCCCTCAA AATCAGTGGG CTCTGGCTCA 102 0 

TGACCAGTGG TCGGAGATAT GGAGTCATGA CGATAGAAAC AGAAGATTTG AGGG ~ AATTT i0 80 

GTGAAAGCTT TCAGTCTCTT GGTCTATCAG TGACAGCG AA CGCACCTGCA CATGTGGGAA 1140 
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GTTTCAATCG AAGAAGTTTC CA7GTATGCA CCCAGAAATG GTGCAAAGGA TTGTTAACTT 1200 
GTGTCATTCA CAAATGTTGG ATGCAATGGA GCTGACTAGG AGAATGCACC TTACACGCCC 1260 
ACTCAGTGTT CTCTTATCTC TAGACCTGAA ACTAACTTGC TGTGTAA7TC GAGTTACAAA 13 20 

AGGTTAAAGG AAGAATTAGG AAGATACATA TAACATGAAT GTTGCCAGAA GTTCAGGGAA 1380 
CTTGAATATT CTTTTGGTTC TTGGTGGAAA ATATCCAACA GATGAACAAT TTGACATTAT 144 0 

TTCACACTTT GATTCTAGCA ACTCTGTAAC ACCATCATGG GTTATTGTTG ATGTACATAA 150 0 

AT AT AT ATT A CAAATCTGTA TACCATTGGT TCAAATTGTT ACAACATTTG TTTGAAGCAC 1560 
ACCTGCAGCA ATAATACACA GGATGCAAAA CGAAGAGCGA AACTATATGA CGCCAACGA7 1620 
AGACATAAAC AGTTACAGTC ATCATGAAAA C AGAATTATA TGGTACAGCA AAAATTACAC 1630 
TAAGAGGCAA GAGTCTCACC GACGACGATG AGAGAGTTTA CGGTTAGACC TCTTTCCACC 1740 
GGTTGATTTC GATGTGGAAG AAGTCGAATC TGTCAGGGAC GAATTTCCTA ATTCCAAATT 1800 
GTCCTCACTA AAGGCCTTCT TTAGTGTCTC TTGTATTTCC ATGTACCTTT GCTTCTTTTG 
TAGTCGTTTC TCAGCAGTGT CGTCTTCTCC GCAAGCCAGT TGAGTCAAGT CCTCACAGTT 
CATAATCTGG TCGAGCACTG CCGAACAGCG CGGGAAGAAT CGTTTCCCGA GTTCCACTGA 
TGATAAAAAA AACAAGGTCA GACAGCAAGT AACAAAACCA TGTTTAAAGA TC ATTT AGTT 
TTGTTTTTTG TGATAAGGAG TCCGATGAAG TGGGTGAGAA TCCATACCGG TTTTAGAAAG 
CGCTTTTAGT GTACTTTGAT GCTCTTCTAG GATTCTGAAA GGTGCTATCT 7TACACCCCG 
TGATGTTCTC TTCGTACCAG TGAGACGG : C . ^,v, , -^UA- 

TGTTCCCTTC ATTTCGGCGA TCTCCATTGC AGCTTGTGCT TCCGTTGGAA AAAGACGTTG 
AG C AAGTG C A ACTAAAC AGT GGACGACACA AAG AA i A'.) «-^„ .A...A.T-1 
CCTAATAGAG AGGACATAAA TTTAATTCAA ACATATAAG A AATAAGACTT GAT AG AT AC C 24C 
TCTATTTTCA AG ATCGAGC A GCGTCATCTT C AATTC ATCG GCCGCCACTG C AAAAG AGGG 24 6 



1660 
1920 
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28^ 
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AGGAACATCT CTAGGAATTT GTTCTCGTTT GTCTTCTTGC TCTAGTATTT CTACACATAG 2 520 

TCGGCCTTTG AGAGAATGCT TGCATTGCTC CGGG AT ATT A TTACATTCAA CCGCCATAGT 2 5 80 

GGCT7GTTTT GCGATGATGA GTGCGGTTCT ACCTTCCAAA GTTGCTTCTG ATGCACTTGC 2640 

ACCTTTTTCC AATAGAGATA GTATCAATTG TGGCTCCTTC CGGATCGCAG CAACATGAAG 2 70 0 

CACCGTATAT CCCCTCGGAT TCCTATGGTT GACATCGGCA AGATCAAGTT TTAAAAGATC 2 76 0 

TGT7GCGGTC TTCACATTGG AATATGCAAC AGCGAAATGA AGAGCACACG CATCATCTAG 2 82 0 

ATTGGTGTGA TCCTCTTTGA AAAGCAACTT GACTAACTCA ATATCATCCG AGTCAAGTGC 2 8 80 

CTTATGTACA TTCGAGACAT GTTTCTTTAG TTTAGGTACG TCCAAACCAA GCTCTTTACG 2 94 0 

TCTATC AATT ATCTCTTTAA CAAGCTCTTC CGGCAATGAC TTTTCAAGAC TAACCATATC 3 000 

TAG ATT AG AC TTGACAATAA TCTGTTTAGA TCTATCCAAT AGGTTGATAG AAGCTTTACC 3 060 

ACATATATTA GCAAGCTTGA GTATAACCAA TGTGTCCTCT ATAACAACTT TGTCTACAAC 312 0 

GTCCAATAAG TGCCTCTGAA ATACAAATAC AAGTACTCAA GTAAGAACAT ATTCATGAAT 3180 

GTGTAAGCAT AGCTTAATGC AGATGGTGTT TTACGTGATA G AG AG T AATT AATTCAGGGA 324C 

TCT7GAAGAT GAAAGCCAAA TAGAGAACCT CCAACATGAA ATCGACCGCC GGCCGGCAAG 3 3 00 

CCACGTGGC A GC AATTCTCG TCTGCGCATT GAGAAACTCC TTTAGGCGGC GGTCTCACTC 3 360 

TGCTGCTGTA A A CAT AAG C C AAAACAGTCA CAACCGA ATC CAAACCGACT T G G T AA TC C T 3420 

TGGCAA.TCTC CTTAAGCTCG AGCTTCACGG CGGCGGTGTT GTTGGAGTCT TTCTCCTTCT 3 480 

TAGCGGCGGC TAAAGCGCTC TTGAAGAAAG AGCTTCTCGC TGACAAAACG CACCGGTGGA 3 54 0 

AAGAAACTTc CCGGCCGTCG GAGAGAACAA GCTTAGCGTC GCTGTAGAAA TCATCCGGCG 3600 

AGTCAAAGAC GGATTCGAAG CTGTTGCJAGA GCAATTGC AC A - ■ J AC AT AC A TCA.CiGTCCGG 3 6 60 

TGAGTACTTG TTCGGCGGCC AGATAAACAA TAGAGGAGT GG7GT7A7CG .' T A G C G A C G A '72 0 
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AACTAGTGCT GCTGATTTCA TAAGAATCGG CGAATCCATC AATGGTGGTG TGCATCAACA 378 0 

GGTTCCGATG AATTGAAATT CACAAATTAA AGAGATCTCT GCTAATCAAC GAAGAGACCT 3 340 

TATCAACTGG ATTTGGTTAA AGATCGAAGA TAACCATTGA CGAGCAGAGC CAAGTCAAGT 3 9 00 

CAACGAGAGT GGTGGTGAGA TATGAAGAAG CATCCTCGTC CCACGGTTTA CATTTCACCA 3960 

AAACCGGTAA ATTTCCAGGA AAGGAATCTT TGTCAGAGAT CTTTTTTAAA AAGATATAAC 4 02C 

AGGAAGCTAA ACCGGTTCGG GTTATAAATG TTAGTATTTA TACCGGAGAC ATTTTGTGTT 4 0 80 

GCTAATTTTT GTATATGAGA AGTTCAATCC GGTTCGGTAA GCCCCTGAAC CAAACTAGAT 414 0 

TTGGAGATGA TATAAAT ATA TAAAATTTAT TTTTCATCCG GTTCGTTATT TTCATATAAA 4 2 00 

TATATAAATA TTATTTTTTA AATTTAAGAA TTAGATTTAC ATGTGAAAGT TACATTTCTG 4 2 6C 

TTTATTTTCT TTGAAGTAAA ATGATAAAGG GAACGTATAT TAAGTTTCAT GCTTTATTCA 4 3 2C 

CATAAGTTTT GTAATGTATA TTATATTTTT CGTTTATTGA AAAAGTAATT TTCAGTGTTC 4 3 80 

AGCATGTTTA CACTATAATT AAATCAAGTC GAATATTTCC TGGAACTATT CTCCTTGTTC 4 4 40 

TATAGCAAAT GAAAACGCTC TTCACAACAA AATCATTATA GATATAGGAA TAAATTACAT 4 5 00 

TAAAAACATG AAAGTCATAA TGAATATATT TTTTTAATTA GGATTTGATT TAAAAACAAT 4 560 

TATTGTATAC ATATAAAAGA CTTCTTTAGT TATTTGCCTT CAACTTCTCG TTCTGAATCA 4 62 0 

TGCGATAAAT CAGCTTTTTC AATAACTACG ACGTAAAAGC AAATTC ATAA CACGTCTAAA 4 680 

CAAATTTGGC TCATCCTTCA CTTGATTGGT GTTTTCCGGA CTCGATGTTG CTGGAAACTG 474 0 

AGAAGAAGAA GGAATCTGCA TAATCACCTC TTGGTTCCTC ACCGGTAGAC TCATTTTGTT 4 800 

GGATCGAAAA CGATCGAGAT CAGAAAATGA AAAGATAGGT T AAAGATGCC TATGAATACA 4 8 60 

AC AACGTAAG ATTATGTTGA ATAAACAGAG TACTTTATA"!' AGGAGT7ATA ATAAGGTAAA 4 920 

TAAATTATTG CTTTCCGCGT TTTTTACTTT TGTATTTCTT AAATGATAAG TTAAATTAGG 4 9 80 

ATAAGATTTG TATGATTTTA AGTAAATTTA CAATAACTCT CTATAACTCA AT AGCATC AC 50 4 0 
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ATATTTAATT AA7TTTACTA ATTATCTTTT GAACAATTTT ATGAAATAGT TTTCTTTTAA 5100 
7TAA7TT77T AAAATGATAT ATTA7AAAA7 TTAATTGAAT CAATCTGATA TAATTTTTTT 5160 
ATCTTCTACC ATCTATTATA GTTGATAAAT ATTGTGATAA ACTTTAGATA AACACCCAAT 5220 
TGCCAAATAT TTAATAAATT TTGTGTACCA TGCGTTTTTT TTGGAGAATA TATATACGTG 52 80 

G AC AG CAT AC CGTACATATA TTGTATAAAA GCTTATAAAA CATAGATACG GGTTATATTG 
GTAAGCTATA AATATATGTA AACAATAGTA AGATATTACG TG7TGTGTCT AAATA7CTG7 
TGCTTTAGAT ATTATGTATA TCTAATATAT TAAAATATCT 7T7AT7AACT AA7ATATTA7 54 6 0 

TTAAGAGAGA AAATTGGGAC ACTATTTTCT ATACAGTAAC TGTTTTCAAC 7A7AAACAGG 5 52 0 

AACCCTTGAT A7AATAAAAT AACTAGCCAA AAAATCAGAT TAAATATTCA TAAAACAA7G 5 5 80 

777GG7A7TA TTACATAAAC CTAAGAAACA AAATTCAATA TTCCTTTTTA CCTTATAAAA 5 640 

AACAATTAAA CATCACTAGA TATATTTATG CCCCACAATG AGCGAGCCAA TTGAGACTTG 5700 
AGACTTGAGA TCCTTGTCAA CTACGTTTGC ATTTGTCGGC CCATTTTTTT TATTTTTTTT 
TTAAAGTGTC GGCCCGTTGC TTCTTCCGTT CAGATCAACC C7CTCGTAA7 CAGAA'JAAAA 
CGGAAAACAA ACGAAAGAAC AATCAGA7CC CTCTT7TTTT GCATAAACTA AATTCAACTT 
CTCTGCGTTT ATGTTGTAGA GGCAACCACG A7CACTACTA CGAAACAATA CAACGTCGTT 
GCTTCGAGTC CACGTAA7CA AATCTACTCC AATGCTTTTA ATATCTTTCA C7TTAACCCA 
CGACTTTTCA AAACTGCTCT TTAAAACCCA TAACTCGTGA ACATCTTCTT GATCTTTGTT 
TGTCCACTGA CGAATAGCAC CTAGCTTCCC TTCGTATCTG ACTAATCCTG AGAAAACATC 
AGAGTTCGGA GTA7GGAAGA AGGACCAAGT TTCGGTTTTG ACACAAAACC GGATCACATT 
GTTGTTC-CT GAT A T C r AA 7 GCAACAAGGG CGAAAC77G7 ATOGwTiGG .AAA A AA 7 7 A A 6240 
7C7G7C7G77 77TGGTAGAC GCAAATTTT~ 7AATCTCT7C C A G G 7 AAA C G A A. 7 C A G AA T r " 



5340 
5400 



5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 



6 3 o n 



burstitiitf ^rmn.P^ 
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GAAAACT.CG CACATAAAAG 77CTGTGATT CAAATGGTAG ATACCCCGAG ACATACACAT 636C 

ACGCCGAGAC TGCGAAAGCC TTTGTATTTT ATACCGGAAA GGGTTCAATC CCATTACCGC 6420 

TAAACCCAAT GACATATCCC AACCCT.CAC TTCTGGCTTT GGTATGACCT GATACTGTTT 6480 

AGTGGTTGGT TTGAAGACTA TGTATCCACG TGATGGTTTT GTATACTTAA CACAAAGCAA 65 40 

TATCCCATGA CTTGCATCAC AAGCTTCGAT CTTTATCATT CCGGGTGGCA GAAAGTCGAT 660= 

" — CTCATGGACA AAACTGGTTC GAAGTTCGTG 5660 

TCCT7TTAC7 ATG7AGTGTT GTATGAAGTA TCCCGAAATA CGATTGGTTC TAAGGAGATT 5720 
AAGA7TGACA AACCATGACT CG7AGCT7CT 7TTGTTGCAC TCTTTATTCA GGAGCCTGAA 
TTTTCCGATT 777GACGCCG GAAGATAAGA AAGAAATTCT TGGATCATGT CTTGATTTAT 

CACCGGAGAA CT^ATGATP^ — Tr ^ rr ,,, m 

C - -TC^AAT AAACAGATGA GCACGATCAC TGAATGAGAA 

ATGAAAAAAT CAGGATCGGT AOAGAACAAC TTATGATGAA TAAAGTGTTT ATATATGCTT 
TCTTTGTTTA AGGAAAGTAT CAAAATTTGC CTmTC^C GCTAGTCCTA AAACAAACAA 
ATTAACCAAA AGATAAAATC TTTCATGATT AATGTTACTT GTGATACCTT AAGCCAAAAG 70,0 
TTTATCTTTA GAC^AAC CAAATCTACA GTAATTTAAT TGCTAGACTT AGGAAACAAC 
TTTTTTTTTT AGCCAACAAT CTTTGGAT7T T AA TTGTTTT TTTTTCTACT AATAGATTAA , J0C 

CAACTCATTA TATAATAATG 77TC7ATCAT h-t^^.t ~ 

ATCAT AA.TGACAAT TCTTTCTTTT TAATAAACAT 7 2 60 

CCAGCTTGTA TAATAATCCA CAAGTCAATT T^AC^A-T— 

-- - r ~- n - • • A TTTTCTTATA 

AAAAT7AGCA CAAAAAAGAT 1'ATCATTGTT TAGCAGATTT AATTTCTAAT TAACTTACGT 
AATTTCCATT TTCCATAGAT TTATCTTTCT TTTTATTTCC 7TAGTTATCT TAGTACTTTC 74<! 
TTAGTTTCCT T«T«TTTT AAATTTTAAG ATAA.ATA,, GAAATTAAAA GAAGAAAAAA , M( 

AACTCTAGTT ATACTTTTGT TAAATG^TTC A T-rAr-- ~ 

i.^i^.I^ AT^ ACT AA CTAATAATTT TTTTTAGTTA -55.- 

AATTACAATA TATAAACACT GAAGAAAGTT TTTGG~C r AC ACTT^Tr- r*^« 

-v-^ca^ AL uiTi TGG GATCAATTAG ^ ? p 



6 7 6 0 
6840 
6900 
6960 
""02 0 



■ j z 0 



^380 
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TACTATAGTT AGGGGAAGAT TCTGATTTAA AGGATACCAA AAA TG AC TAG TTAGGACATG 76 80 

AATGAAAACT TATAATCTCA ATAACATACA TACGTG7TAC TGAACAATAG TAACATCTTA 7 74 0 

CGTGTTTTGT CCATATATTT GTTGCTTATA AATATATTCA TATAACAATG TTTGCATTAA 7 8 00 

GCT77TAAGA AGCACAAAAC CATATAACAA AATTAAATAT TCCTATCCCT ACCAAAAAAA 78 60 

AAAATTAAAT ATTCCTACAG CCTTGTTGAT TATTTTATGC CCTACGTTGA GCCTTGTTGA 7 92 0 

CTAG7TTGCA TTTGTCGGTC CATTTCTTCT TCCGTCCAGA TCAACCCTCT CGTAATCAGA 79 R0 

ACAAAAGGGG AAACAAACGT AAGAGGCAAA ATCCTTGTTT GTATGAACTA AGTTTAACTT 804 C 

CTCTGTGTT7 AAGTTGTAGA GGCAAACATG ATCCCAACTA GAAAGCATTA CGACGTCGTT 8100 

GCTTGGTA7C CACGTAATAT GCTCTACTCC AATGCTTTCA ATATCTTTCA CTTTTTCCCA 3160 

CG ACTTTT 2 A AAAGTGCTCT TTAAAACCCA TAATCTGTGA AC ATCTTCTT GATTGTTGTT 92 2 0 

TATCCAGTGA CGAATAACAC CTAGCTTCCC TTCGTAGCTG ACTAACTCTG GGAATAAACC 62 8 0 

AACGTTTGGA GTATGTAAGA AAGACCAAGT TTCGGTTTTG GGACATAACC GGATCACATT 83 4 0 

GTGGTTCCAT GATCTCCAAT GCAAGAACCC TGAAGCTTGT ACCGGGTTTG AAAGAATTAG 84 0 0 

r\'^'^'^3 i i j. i ~ i '-uu * ^ix_iii>_ -j^ . . « • * . . » - ^ • v- . * w . -„ - - - - » - — ~ ^ - - 

AAAAACTTCG CACGCAAAAG TTCTGAGATT CCGAGTCATA CC AGGCGATT TCGAAAGCCT 8 52 0 

AAATATTTTA TACCGGAAAG GCTGCAATCC GGTTACCGTT AGACCTAATG ACTTATCACA 8 58 0 

ACTCCTCACT TTTGGGTTTG GTATGATCTG ATACTGTTTT GTTGTTGGTT TGCAGACTAT 8 64 0 

GTATTCCGGT ATTGGTCTTG TATCATTATA ACAAAGCAAT ATCCCATGAC GTGCATCACA 87 0 C 

AGCTTTGATC TTTACCTCTC CTTGTGGCAG AAA AT C G A T 3 GAGA TTCCTT T3T7ATCCAA 87 6 0 

ATCTCTCCTC TCATGGAAAA AACTGGTATC AAGTTTGTAT "CTCTTTCGT AGCGTTCTAG 8 8 20 

GAAGTATCCA GAGATATTGT TGGTTCCATG GAGATTTAGG TTGACAAACC AAGACTCGTA 8880 
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GCTTCTCTTG TTGCACTCTT TATTGATGAG CCT^AAT^T^ — 

v-^^AAT.i. .wATTTCGG ACCCCCGAAG 8940 

ATAAGAA.GA ACCTGTTGGA .CCTGTCCTC ATCTATCACC GGAGAACTCA TGATCTTATT „„0 

CGAAAAAAGA AAGAAAGAGA TGAGCACGAT CAGTGAATGA GATATATAGA AATCAGGATT MS0 

GGTAGAGAAC CGACGATGAT GAATATACAA GTCTTTA.AA GTATCACAAA TTGCCTTTT7 ,, 20 

GTTGGCTAGT CCCAAAACAA GCAAATTAAC C AAA GAT AAA ATCTTCATTA ATGTTTTCCT 318C 

. TTCTTCGC CAGTCCCAGA TAAAAATATA ti ~s » « . -r . T ^„ . 

- A . At-lAmT A T TTCATTAGGT TACTTGTAGT 924 0 

ACCTTGACCC CAAAGTTTCT CT~TTGACT^ tt a « ,-v- » . 

C .ACT . TTAACCAAAT TAACAGTAAT TTAATAGCTA 9 3 0 C 

vACTTAGAAA ACAACATTTT HTA"" 1 A ts^it 

jACAT C,^AAATTCAA CAATCTTTGG 93 60 

GTTTCTATAG ""GTTTTTTTT GTTATTCTAA TAGATTACCA GTCATTATAT CATATACAAA ,„„ 

™ t ™ m AGCAAGTTTG TT GT T ATATC m , 0 

ATCATTCAGC AGATTTCTTA AT~AAA— - rrr-»^ 

A '- rtA — i » GTGATTTCCA TTTTGCACCT ATATGTTTCT 

CTTTCTTAGT- TTAGTACTTT AAATTTTCAT ATATATAATT 

lA1AfllT TATTAAAATT AAAAGTAAAA 

«TCC«TTT AACTTATGTT AAATGTTTCA TC AC ACT AAA AGAGCATTAA GTAATAAATA 
TTTTAGCTTT ATGAAAAAAA ATATCAAATC ACTGAAGACA TTTGTTGGCC TATACTCTAT 
TTTTTATTTG GCCAATTAGT AATAGACTAA TAGTAACTCA TATGATATGT CTCTAATTCT 
GGCGAAACGA ATATTCTGAT TCTAAAGATA GTAAAAATGA 

- rcATuA AtjGGAATACT 9840 

ATTTCACACA C C T A G AAA G A GTAACGTAGA AACCTTTT 



9 54C 
9 60C 
9660 
972C 
9780 



:ttttggt:a gattcttgta 9900 

TCAAGAAGTT CTCATCGAT 



99 19 



(2) INFORMATION FOR SEQ ID NO : 2 : 

( i ) SEQUENCE CHARACTERISTICS : 

<Ai LENGTH: 55 5b base pairs 
|B! TYPE: nucleic acid 
(C) STRANDEDNESS : single 
:0) TOPOLOGY ■ linear 
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iii) MOLECULE TYPE: DNA (genomic} 
(in) HYPOTHETICAL: NO 
i:v) ANT I - SENSE : NO 



(IX) FEATURE: 

; A ) NAM E / KEY. e xon 

(B) LOCATION: 2787.. 3347 

(D) OTHER INFORMATION: /producC= " 1st. excn of NIMI " 

(IX) FEATURE: 

(A) NAME / KEY : exon 

(B) LOCATION: 3427.. 4162 

(D) OTHER INFORMATION : /product= "2nd exon of NIMI" 

(IX) FEATURE. 

(A) NAME / KEY : exon 

(B) LOCATION : 4271.. 4474 

<D> OTHER INFORMATION: /product= "3rd exon of NIMI " 

(ix) FEATURE. 

(A) NAME /KEY : exon 

(B) LOCATION: 4 58 6.. 4 8 6 6 

,D) OTHER INFORMATION; /product "4th exon cf NIMI" 

(lx) FEATURE: 

(A) NAME /KEY: CDS 

( 3 ) LOC AT I ON ; join {2787. .3347, 3427. .4x62, 4 2 1 . 4474, 
4S86 . . 4366 ) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

TGTGATGCAA GTCATGGGAT ATTGCTTTGT GTTAAGTATA CAAAACCATC ACGTGGATAC 6 0 

ATAGTCTTCA AACCAACCAC TAAACAGTAT CAGGTCATAC CAAAGCCAGA AGTGAAGGGT 120 

TGGGATATGT CATTGGGTTT AGCGGTAATC GGATTGAACC CTTTCCGGTA ~ ' AAAA T A C A A 180 
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300 



420 
480 

540 
600 
660 
72 0 
780 
S40 



ACTTTTATGT GCG AAGTTTT CGAT rpr *TGA rn ^^t^sp^t „ x _ 

' ~ ^ A . . L - j x T * ACCT GGAAGAG ATT AGAAAATTTG 

CGTCTACCAA AAACAGACAG ATTAAT^TT m , > ^^^^ , ^ « ^ . 

AAltiT * ^CA^CCGA TACAAGTTTC GGGGTTCTTG 3 60 

CATTGGATAT CACGGAACAA CAATGTGATC CGGTT^TGT- — * 

ojA * * ^CAaaACCGA AACTTGGTCC 

TTCTTCCATA CTCCGAACTC TGATGTTTTC TCAGGATTAG TCAGATACGA AGGGAAGCTA 

GGTGCTATTC GTCAGTGGAC AAACAAAGAT CAAGAAGATG TTCACGAGTT ATGGGTTTTA 

AAGAGCAGTT TTGAAAAGTC GTGGGTTAAA GTGAAAGATA TTAAAAGCAT TGGAGTAGAT 

TTGATTACGT GGACTCCAAG CAACGACGTT GTATTGTTTC GTAGTAGTGA TCGTGGTTGC 

CTC7ACAACA TAAACGCAGA GAAGTTGAAT TTAGTTTATG CAAAAAAAGA GGGATCTGAT 

TGTTCTTTCG TTTGTTTTCC GTTTTGTTCT GATTACGAGA GGGTTGATCT GAACGGAAGA 

AGCAACGGGC CGACACTTTA AAAAAAAAAT AAAAAAAATG GGCCGACAAA TGCAAACGTA 

GTTGACAAGG ATCTCAAGTC TCAAGTCTCA ATTGGCTCGC TCATTGTGGG GCATAAATAT 

ATCTAGTGAT GTTTAATTGT TTTTTATAAG GTAAAAAGGA ATATTGAATT TTGTTTCTTA 

GGTTTATGTA ATAATACCAA ACATTGTTTT ATGAATATTT AATCTGATTT TTTGGCTAGT 10 20 

TATTTTATTA TATCAAGGGT TGCTGTTTAT AGTTGAAAAC AGTTACTGTA TAGAAAATAG 108 0 

TGTCCCAATT TTCTCTCTTA AATAATATAT T^GTTAATAA AA^T tW * 

bliM1AA «AoATATTTT AATATATTAG H40 

ATATACATAA TATCTAAAGC AACACA7ATT TAG AC ACAAC ACGTAATATC TTACTATTGT :200 

TTACATATAT TTATAGCTTA CCAATATAAC CCGTATCTAT GTTTTATAAG CTTT TATAC A 1 260 

ATATATGTAC GGTATGCTGT CCACGTATAT ATATTCTCCA AAAAAAAGGC ATGG TACACA 13 2 0 

AAATTTATTA AATATTTGGC AATTGGGTGT TTATPTAAA" n-»Tr^* f t . m 

IUAA ' ,J i T i A i CAC AA TATTTATCAA 13 80 

CTATAATAGA TGGTAGAAGA TAAAAAAATT ATAT r "AGA~T - , . ^ . . ,^ 

./M^M^rt.i . Tv. A/\TTrt AATT7TATAA 1440 

TATATGATTT TAAAAAATTA ATTAAAAGAA AACTATTT r "A T^yr-™ r- - . . . 
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TTAGTAAAAT TAATTAAATA TGTGATGCTA TTGAGTTATA GAGAGTTATT GTAAATTTAC 15 60 

TTAAAATCAT ACAAATCTTA TCCTAATTTA ACTTATCATT TAAGAAATAC AAAAGTAAAA I62C 

AA C G C GG AAA GCAATAATTT ATTTACCTTA TTATAACTCC TATATAAAGT ACTCTGTTTA 1680 

TTCAACATAA TCTTACGTTG TTGTATTCAT AGGCATCTTT AACCTATCTT TTCATTTTCT 17 4 0 

GATCTCGATC GTTTTCGATC CAACAAAATG AGTCTACCGG TGAGGAACCA AGAGGTGATT 180C 

ATGCAGATTC CTTCTTCTTC TCAGTTTCCA GCAACATCGA GTCCGG AAAA CACCAATCAA 1860 

G7GAAGGATG AGCCAAATTT GTTTAGACGT GTTATGAATT TGCTTTTACG TCGTAGTTAT 192 0 

TGAAAAAGCT GATTTATCGC ATGATTCAGA ACGAGAAGTT GAAGGCAAAT AACTAAAGAA 198 0 

GTCTTTTATA TGTATACAAT AATTGTTTTT AAATC AAATC CTAATTAAAA AAATATATTC 2 04 0 

ATT AT G AC TT TCATGTTTTT AATGTAATTT ATTCCTATAT CTATAATGAT TTTGTTGTGA 2100 

AGAGCGTTTT CATTTGCTAT AGAACAAGGA GAATAGTTCC AGGAAATATT CGACTTGATT 2160 

TAATTATAGT GTAAACATGC TGAACACTGA AAATTACTTT TTCAATAAAC GAAAAATATA 222 0 

AT AT AC ATT A CAAAACTTAT GTGAATAAAG CATGAAACTT AATATACGTT CCCTTTATCA 2 2 80 

T7TTAC7TCA AAGAAAATAA ACAGAAATGT AACTTTCACA TG T AAA T C T A ATTCTTAAAT 2 34 0 

TTAAAAAATA ATATTTATAT ATTTATATGA AAATAACGAA CCGGATGAAA AATAAATTTT 2 4 0C 

ATATATTTAT ATCATCTCC A AATCTAGTTT GGTTCAGGGG CTTACCG AAC CGGATTGAAG 2 4 6 0 

T T C T C AT A T A CAAAAATTAG C AAC AC AAAA TGTCTCOGGT A T AAA T A C T A ACATTTATAA 2 52 0 

CCCGAACCGG TTTAGCTTCC TGTTATATCT TTTTAAAAAA GATCTCTGAC AAAGATTCCT 2 580 

TTCCTGGAAA TTTACCGGTT TTGGTGAAAT GTAAACCGTG GGACGAGGAT GCTTCTTCAT 2 64 0 

ATCTCACCAC CACTCTCGTT GACTTGACTT GGCTCTGCTC GTCAATGGTT ATCTTCGATC 2700 

T T T AA C C AAA TCCAGTTGAT AAGGTCTCTT CCTTGATTAG CAGAG ATCTC TTTAATTTGT 27 60 

GAATTTCAAT TTATCGGAAC C 7GTTG ATG GAG A "~ A"" ATT ~AT GTA TTG CO " 2813 
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Met Asp 

1 c; 



ir Thr Asp Gly Phe Ala 



TAT GA A ATC AGC AGO ACT ACT T~C G ~C r C - 
Asp ser Tyr Glu He Ser Se^ * ^ C ACC 2861 

er Tiir Ser Pne V ^ Ala Thr Asp Asn Thr 

~ 5 20 25 

GAC TCC TCT ATT CTT TAT CTC GCC CCC GAA CAA GTA CTC ACC GGA OCT 
" P 116 Val L - Ala Ala Glu Gin va 1 Leu Thr Gly Prl 

J ^ 40 



100 



105 



AAC ACC GCC CCC CTC AAC CTC GAG CTT AAG GAG ATT GCC AAG GA~ TAC 
Asc Thr «. „. ¥al Lys Leu Glu Leu , ys n> « «• -c 

110 * y ~ 



115 



120 



GAA GTC GGT TT^ GAT Trr — r r— ~ * ™ 

„ v 0 G ^ GTT TTG OCT TAT GTT TAC AGC 

G '- Vd l Gly Phe Asp S«- "= 1 "= i • . 

1? , A " r " e ~ Ald T Vr Va. Tyr Ser 

130 n5 

sZ 2 Z I" T G ^ ^ GAC GAG 

: ^ Pr ° Gi V Val Ser Riu Cys Ala As P Glu 

140 145 150 

Asn Cys Cys His Val Aia Cys A-a Pro A'. ■ - • - - 

, 55 . 9 ° A ' a Va ' p ^ Xet Leu Glu 



2957 



GAT GTA TCT GCT CTG CAA TTG CTC TCC AAC AGC TTC GAA , C C G ~ TTT 
Ma L ^ ° la ^ ^ Ser Asn Ser P he Glu Ser Val p" 
45 50 5 5 

GAC TCG CCG GAT GAT TTC TAC AGC GAC GCT AAG CTT GTT -TC T r^ PAP 
Asp Ser Pro Asp Asp P he Tyr Ser Asp A1- ^ _ ™ ^ £ ™ 

65 70 

GGC CGG CAA GTT TCT TTC CAC CGG TGC GTT TTG TCA GCG AG A AGC TCT 3053 
G-y Ars c.u Val ser Pne „ ia Arg Cys Val Leu fier ^ ^ ^ ™ 

' 5 80 85 

TTC TTC AAG AGC GCT TTA GCC GCC GCT AAG AAG GAG AAA GAC TC~ AAC 
- P he Lys ser Ala Leu Ma a ^ ^ l ^ ^ p ^ AAC 



3101 



3149 



3 1^7 



3245 



3293 



160 
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GTT CTC TAT TTG GCT TTC ATC TTC AAG ATC CCT GAA TTA ATT ACT CTC 
Val Leu Tyr Leu Ala Phe He Phe Lys He Pro Glu Leu He Thr Leu 

17C 180 ii 8 5 



3 341 



TAT GAG GTAAAACACC ATCTGCATTA AGCTATGGTT ACACATTCAT GAATATGTTC 3 3 97 

Tyr Gin 



TTACTTGAGT ACTTGTATTT GTATTTCAG AGG CAC TTA TTG GAC GTT GTA GAC 

Arg His Leu Leu Asp Val Val Asp 
190 195 

AAA GTT GTT ATA GAG GAC AC A TTG GTT ATA CTC AAG CTT GCT AAT ATA 
Lys Val Val He Glu Asp Thr Leu Val He Leu Lys Leu Ala Asn He 
200 205 210 

TGT GGT AAA GCT TGT ATG AAG CTA TTG GAT AGA TGT AAA GAG ATT ATT 
Cys Gly Lys Ala Cys Met Lys Leu Leu Asp Arg Cys Lys Glu He He 
215 220 225 

GTC AAG TCT AAT GTA GAT ATG GTT AGT CTT GAA AAG TCA TTG CCG GAA 
Val Lys Ser Asn Val Asp Met Val Ser Leu Glu Lys Ser Leu Pro Glu 
230 235 2 4o 

GAG CTT GTT AAA GAG ATA ATT GAT AGA CGT AAA GAG CTT GGT TTG GAG 3 642 

Glu Leu Val Lys Glu He He Asp Arg Arg Lvs Glu Leu Gly Leu G 1 u 
245 2 5C 2S5 



GTA CCT AAA GTA AAG AAA CAT GTC TCG AAT GTA CAT AAG GCA CTT GAC 
Val Pro Lys Val Lys Lys His Val Ser Asn Val His Lys Ala Leu Asp 

260 265 ->nn 



75 



AAT CTA GAT GAT GCG TGT GCT CTT CAT TTC GCT GTT GCA TAT TGC A^T 

Asn Leu Asp Asp Ala Cys Ala Leu His Phe Ala Va I Ala Tyr Cys Asn 

295 300 305 

GTC, AAG ACC GCA AC A GAT CTT TTA AAA CTT GAT CTT GCC GAT GTC AAC 

Val Lys Thr Ala Thr Asp Leu Leu Lys Leu Asp Leu Al 3 As P v^l Asr 

3 1- ' r - j - 



3450 



349S 



354« 



3 594 



3 6 9 0 



TCG r,A T GAT ATT GAG TTA GTC AAG TTG CTT TTG AAA GAG GAT CAC ACC J/ 3 8 

Ser Asp Asp He Glu Leu Val Lys Leu Leu Leu Lys Glu Asp His Thr 
280 285 290 



3 7 86 



18 3 4 
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CAT AGG AAT CCG AGG GGA TAT ACG GTG CTT CAT GTT GCT GCG ATG CGG 
His Arg Asr. Pro Arg Gly Tyr Thr Val Leu His Val Ala Ala Met Arg 

325 330 335 

AAG GAG CCA CAA TTG ATA CTA TCT CTA TTG GAA AAA GGT GCA AGT GCA 
Lys Giu Pro Gin Leu He Leu Ser Leu Leu Giu Lys Gly Ala Ser Ala 
340 345 350 355 

TCA GAA GCA ACT TTG GAA GGT AGA ACC GCA CTC ATG ATC GCA AAA CAA 
Ser Giu Ala Thr Leu Giu Gly Arg Thr Ala Leu Met He Ala Lys Gin 
360 365 370 

GCC ACT ATG GCG GTT GAA TGT AAT AAT ATC CCG GAG CAA TGC AAG CAT 
Ala Thr y.ez Ala Val Giu Cys Asn Asn He Pro Giu Gin Cys Lys His 
375 380 3 85 

TCT CTC AAA GGC CGA CTA TGT GTA GAA ATA CTA GAG CAA GAA GAC AAA 
Ser Leu Lys Gly Arg Leu Cys Val Giu He Leu Giu Gin Giu Asp Lys 
390 395 400 

CGA GAA CAA ATT CCT AGA GAT GTT CCT CCC TCT TTT GCA GTG GCG GCC 
Arg Giu Gin He Pro Arg Asp Val Pro Pro Ser Phe Ala Val Ala Ala 
405 410 415 

GAT GAA TTG AAG ATG ACG CTG CTC GAT CTT GAA AAT AGA G 
Asp Giu Leu Lys Met Thr Leu Leu Asp Leu Giu Asn Arg 
420 425 430 

GTATCTATCA AGTCTTATTT CTTATATGTT TGAATTAAAT TTATGTCCTC TCTATTAGGA 

AACTCACTCA ACTAATGATA ACTATTCTTT GTGTCCTCCA CTGTTTAG TT GCA CTT 

'-■ a 1 A ^ a Leu 
4 35 

GCT CAA CGT CTT TTT CCA ACG GAA GCA CAA CCT GCA ATG GAG ATC GCC 
Ala Gin Arg Leu Phe Pro Thr Giu Ala Gin Ala Ala Met Giu lie Ala 



440 44 ; 



450 



3882 



3 93 0 



3 978 



402 € 



4074 



4122 



.62 



422 2 



4 32 6 



GAA ATG AAG GGA AC A TGT GAG TTC ATA GTG "T VG~ — ^AG r— ^ar , i - „ 

Giu Met Lys Gly Thr Cys Giu Phe He Val Thr Scr L^u Giu Pro Asp 
455 460 465 
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CGT CTC ACT GGT ACG AAG AGA AC A TCA CCG GGT GTA AAG ATA GCA CCT 4 4 22 

Arg Leu Thr Gly Thr Lys Arg Thr Ser Pro Gly Val Lys lie Ala Pro 
470 475 480 

TTC AGA ATC CTA GAA GAG CAT CAA ACT AGA CTA AAA GCG CTT TCT AAA 4 470 

?he Arg He Leu Glu Glu His Gin Ser Arg Leu Lys Ala Leu Ser Lys 
435 490 495 

ACC G GTATGGATTC TCACCCACTT CATCGGACTC CTTATCACAA AAAACAAAAC 4 52 4 

Thr 
500 

TAAATGATCT TTAAACATGG TTTTGTTACT TGCTGTCTGA CCTTGTTTTT TTTATCATCA 4584 

G TG GAA CTC GGG AAA CGA TTC TTC CCG CGC TGT TCG GCA GTG CTC 4 62 9 

Val Glu Leu Gly Lys Arg Phe Phe Pro Arg Cys Ser Ala Val Leu 
505 510 515 

GAC GAG ATT ATG AAC TGT GAG GAC TTG ACT CAA CTG GCT TGC GGA GAA 4 677 

Asp Gin He Met Asn Cys Glu Asp Leu Thr Gin Leu Ala Cys Gly Glu 
520 525 530 

GAC GAC ACT GCT GAG AAA CGA CTA CAA AAG AAG CAA AGG TAC ATG GAA 47 2 5 

Asp Asp Thr Ala Glu Lys Arg Leu Gin Lys Lys Gin Arg Tyr Met: Glu 
535 540 545 

_ ... a '"A ^~A AAG AAG r.rr ^T AGT GAG GAC AAT TTG GAA TTA 477 3 

— v . r , t.,- ^t-j pu 0 c,-»»- ni n icp acri Leu C \ u I.eu 
lie ijiri sj^u an" i-e'- j -> — - * ~ — . --i- 

550 555 560 



GGA AAT TCG TCC CTG ACA GAT TCG ACT TCT TCC ACA TCG AAA TCA ACC 
Gly Asn Ser Ser Leu Thr Asp Ser Thr Ser Ser Thr Ser Lys S«»r Thr 

575 

GGT GGA AAG AGG TCT AAC CGT AAA CTC TCT CAT CGT CGT CGG TGA 
Gly Gly Lys Arg Ser Asn Arg Lys Leu Ser His Arg Arg Arg 

qpn 585 590 



482 1 



4866 



492 6 



GACTCTTGCC TCTTAGTGTA ATTTTTGCTG TACCATATAA TTCTGTTTTC ATGATGACTG 
T A A C T G T T T A TGTCTATCGT TGGCGTC ATA TAGTTTCGCT CTTCCTTTTG CATCCTGTGT 
ATTATTGCTG CAGGTGTGCT T G AAA C AAA T GTTGTAAC AA TTTGAACCAA TGGTATACAG 504 6 



■1 9 R 6 
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ATTTGTAATA TATATTTATG TACATCAACA ATAACCCATG ATGGTGTTAC AGAGTTGCTA 5106 

GAATCAAAGT GTGAAATAAT GTCAAATTGT 7CATCTGTTG GATATTTTCC ACCAAGAACC 5166 

AAAAGAATAT TCAAGTTCCC TGAACTTCTG GCAACATTCA TGTTATATGT ATCTTCCTAA 522 5 

TTCTTCCTTT AACCTTTTGT AACTCGAATT ACACAGCAAG TT AG TT TC AG GTCTAGAGAT 5286 

AAGAGAACAC TGAG TGGGCG TGTAAGGTGC ATTCTCCTAG TCAGCTCCAT TGCATCCAAC 5 34 6 

ATTTGTGAAT GACACAAGTT AACAATCCTT TGCACCATTT CTGGGTGCAT ACATGGAAAC 5 4 06 

TTCTTCGATT CAAACTTCC- ACATGTGCAG GTGCGTTCGC TGTCACTGAT AGACCAAGAG 5466 

ACTGAAAGCT TTCACAAATT GCCCTCAAAT CTTCTGTTTC TATCGTCATG ACTCCATATC 5 52 6 

TCCGACCACT GGTC ATGAGC CAGAGCCCAC TGATTTTGAG GGAATTGGGC TAACCATTTC 5 58 6 

CGAGCTTCTG AGTCCTTCTT TTTGATGTCC TTTATGTAGG AATCAAATTC TTCCTTCTGA 5 64 6 
CTTGTGGAT 



i2) INFORMATION FOR SEQ ID NO : 3 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 594 amino acids 

(B) TYPE: amino acid 
ID) TOPOLOGY: linear 

(ll) MOLECULE TYPE: protein 

( x • i 3H5'JE:;CE Z- ESCRi PTIO.V : SEQ ID NO:3: 

Met Asp Thr Thr He Asp Gly Phe Ala Asp Ser Tyr Glu lie Ser Ser 

5 10 15 

Tnr Ser Phe Va 1 Ala Thr Asp Asn Thr Asp Ser Ser lie Val Tyr Leu 

23 25 30 

Ala Ala Glu Gin Val Leu Thr Gly Pro Asp Val Ser Ala Leu Gir. Leu 
3 5 4 0 4 5 



5655 
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Leu Ser Asn Ser Phe Glu Ser Val Phe Asp Ser Pro Asp Asp Phe Tyr 



5C 



55 



60 



Ser Asp Ala Lys Leu Val Leu Ser Asp Gly Arg Glu Vai Ser Phe His 

7 o 75 80 

Arg Cys Val Leu Ser Aia Arg Ser Ser Phe Phe Lys Ser Ala Leu Ala 

90 95 



85 



Ala Ala Lys Lys Glu Lys Asp Ser Asn Asn 

inn 105 



Thr Ala Ala Val Lys Leu 



lu Leu Lys Glu lie Ala Lys Asp Tyr Glu Val Gly Phe Asp Ser Val 



G 

115 



120 ^5 



val Th^ Val Leu Ala Tyr Val Tyr Ser Ser Arg Val Arg Pro Pro Pro 
130 135 140 

Lys Gly Val Ser Glu Cys Ala Asp Glu Asn Cys Cys His Val Ala Cys 
145 150 1" 160 

Arg Pro Ala Val Asp Phe Met Leu Glu Val Leu Tyr Leu Ala Phe lie 
165 170 I 75 

. T, e r.iu Leu lie Thr Leu Tyr Gin Arg His Leu Leu Asp 

flit ■=» - - e 4.*- - i -t ^ ' 

185 190 



Val Val Asp Lys Val Val 

195 



He Glu Asp Thr Leu Val He Leu Lys Leu 



200 205 



Ala Asn lit ^y 

215 



Gly Lys Ala Cys Mer. Lys Leu Leu Asp Ary Cys Lys 



z x - o 



He Val Lys Ser Asn Val Asp Met Val Ser Leu Glu Lys Ser 



Glu He 
225 230 



135 240 



Leu Pro Glu Glu Leu 



Val Lys Glu lie He Asp Arq Arg Lys Glu Leu 



245 250 255 



Giy Leu Glu Val Pro Lys Val Lys Lys His Va : Jer Asn Vai H : s :.y 



2 60 



265 



? 7 0 
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rt .a Leu Asp Ser Asp Asp lie Glu Leu Va 1 Lys Leu Leu Leu Lys Glu 
2 ~ S 290 285 

Asp His Thr Asn Leu Asp Asp Ala Cys Ala Leu His Phe Ala Val Ala 

290 295 300 

Tyr Cys Asn Val Lys Tnr Aia Thr Asp Leu Leu Lys Leu Asp Leu Ala 
305 310 315 320 

Asp Val Asn His Arg Asn Pro Arg Gly Tyr Thr Val Leu His Val Ala 
325 330 335 

Ala Met Arg Lys Glu Pro Gin Leu Tie Leu Ser Leu Leu Glu Lys Gly 
340 345 350 

Ala Ser Ala Ser Glu Ala Thr Leu Glu Gly Arg Thr Aia Leu Met He 
355 360 365 

Ala Lys Gin Ala Thr Met: Ala Val Glu Cys Asn Asn He Pro Glu Gin 
370 375 380 



Cys Lys His Ser Leu Lys Gly Arg Leu Cys Val Glu He Leu Glu Gin 

40C 



385 390 3 95 



Glu Asp Lys Arg Glu Gin He Pro Arg Asp Val Pro Pro Ser Phe Ala 
4 °5 410 415 

Val Ala Ala Asp Glu Leu Lys Met Thr Leu Leu Asp Leu Glu Asn Arg 
420 425 43Q 

Val Ala Leu Ala Gin Arg Leu Phe Pro Thr Glu Ala Glr. Ala Aia Met 
435 440 445 

,1a Glu Met Lys Gly Thr Cys Glu Phe lie Val Thr Ser Leu 
450 455 460 



Glu Pro Asp Arg Leu Thr Gly Thr Lys Arg Thr Ser Pro Gly Val Lys 

480 



465 470 47 



lie Aia Pro Phe Arg lie Leu Glu Glu His Gin Ser Ara Leu Lys Ala 
485 490 495 



Leu Ser Lys Thr Val Glu Leu Gly Lys Arg Pne Phe Pr 



o Arg Cys ser 
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500 

Ala Val Leu Asp 
515 

Cys Gly Glu Asp 
530 

Tyr Met Glu lie 
545 

Leu Glu Leu Gly 



Lys Ser Thr Gly 
580 



Gin He Met Asn 
520 

Asp Thr Ala Glu 
535 

Gin Glu Thr Leu 

550 

Asn Ser Ser Leu 

565 

Gly Lys Arg Ser 



505 

Cys Glu Asp Leu 



Lys Arg Leu Gin 
540 

Lys Lys Ala Phe 

555 

Thr Asp Ser Thr 

570 

Asn Arg Lys Leu 
585 



510 

Thr Gin Leu Ala 
525 

Lys Lys Gin Arg 



Ser Glu Asp Asn 

560 

Ser Ser Thr Ser 
5 "7 5 

Ser His Arg Arg 

590 



Arg 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

He Arg Arg Met Arg Arg Ala Leu Asp Ala Ala Asp He Glu Leu Val 

! 5 10 15 

Lys Leu Met Val Met Gly Glu Gly Leu Asp Leu Asp Asp Ala Leu Ala 

i ~ 2 5 ^ ^ 

Val His Tyr Ala Val Gin His Cys Asn 
35 40 
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) INFORMATION* FOR SEQ 10 NO : 5 . 

'l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 ammo acids 

(B) TYPE: ammo acid 

CO STRANDEDNESS: not relevan: 
(D) TOPOLOGY: not relevanc 

in) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 5 : 

Pro Thr Giy Lys Thr Ala Leu His Leu Ala Ala Glu Met Va 1 Ser Pro 



5 10 15 



Asp Met Val Ser Val Leu Leu Asp His His Ala Aso Xaa Asa Pre a- 

20 25 



Thr Xaa Asp Gly Val Thr 

35 

INFORMATION FOR SEQ ID NO : 6 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

-.ii) MOLECULE TYPE: peptide 



3 0 



:xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



He Arg Arg Met Arg Arg Ala Leu Asp Ala Al< 

15 



La Asp He Glu Leu Val 

1 ^ 10 
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Lys Leu Met Val Met Gly Glu Gly Leu Asp Leu Asp Asp Ala Leu Aia 

2 0 2 5 3 0 

Val His Tyr Ala Val Gin His Cys Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: ammo acid 

id STRANDEDNESS : not relevant 

I'D) TOPOLOGY: not relevant 

(ii> MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Arg Arg Pro Asp Ser Lys Thr Ala Leu His Leu Ala Ala Glu Met Val 

1 5 10 15 

Ser Pro Asp Met Val Ser Val Leu Leu Asp Gin 

20 25 

iZ) INFORMATION FOR SbQ iU NO; O ; 

(i) SEQUENCE CHARACTERISTICS: 

{ A J LENGTH: 4 1 amino acids 
(3) TYPE: amino acid 
( C } STRANDEDNESS: not icicvdiiL 
(D) TOPOLOGY : not relevant 

(li) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 
lie Arq Arg Met Arg Arq Aia Leu Asp Ala A. a Asp lie G 1 1 j Leu Val 
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10 15 



.ys Leu Met Val Met Gly Glu Gl 

30 



y Leu Asp Leu Asp Asp Ala Leu Ale 

20 ~c 



Val His Tyr Ala Val Gin Hi 



s Cys Asn 



3 5 40 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(Ai LENGTH: 27 amino acids 
(B) TYPE: amino acid 
!.C- STRANDEDNESS : not relevant 
(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



Arg Arg Pro Asp Ser Lys Thr Ala Leu His Leu Ala Ala Glu Met Val 

1 5 10 



Ser Pro Asp Met Val Ser Val Leu Leu Asp Gin 
20 25 

(2) INFORMATION FOR SEQ ID NO : 1 0 : 

'i! SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 1 amino acidb 
iB) TYPE: amino acid 
<C) STRANDEDNESS: not relevant 
<D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE : peptide 



(x: J SEQUENCE DESCRIPTION: SEQ ID NO : 1 ( 
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lie Arg Arg Met Arg Arg Ala Leu Asp Ala Ala Asp He Glu Leu Val 
5 10 15 

Lys Leu Met Val Met Gly Giu Gly Leu Asp Leu Asp Asp Ala Leu Ala 
20 25 30 

Val His Tyr Ala Val Gin His Cys Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO: 11: 

( i } SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

!u) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 

Pro Thr Gly Lys Thr Ala Leu His Leu Ala Ala Glu Met Val Ser Pro 
15 10 15 

Asp Met Val 
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Applicant s or agent'* file 

reference number Pb , 2 12 12/ A/CGC 1909 



Internal uxul application f 



PCT/EP 9 7/012 18 



INDICATIONS RELATING TO A DEPOSITS!) MICROORGANISM 

(PCT Rule llbts) 



A. The indications made below relale io the microorganism referred to in :he tiesc;»p(ion 
on page _ , line 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional iheet | J 



Name of depositary msinunon 



rican Type Culture Collection (ATCC) 



Address of deposilar v msinut;on (including pasta* code and country) 

12301 Parklavn Drive 
Rockville, MD 20852 
USA 



Dale of deposit 



25 September 1996 (25.09.96) 



Accession Number 



97736 



C. ADDITIONAL INDI CATIONS (leave blank if not appitcabL) This informanon .s continued on an addiiion.l .beet □ 



We requeat the Expert Solution where aval 



lable 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE Uf the indications are noi for all daifnmud States) 



E. SEPARATE FURNISHING OF INDICATIONS (io^a biank ,f not ap^ 



,cabU\ 



Tbe ind»ot»on» lmed below w.ll be subm.ttod ,o the Inlem.uorul Bureau ia.er (specify ine t e*eral nature of tk* indication*** 'Ac 
Number of Deposit ) *"* 



For receiving Office use only 



This fcbeel wee received with the interna taonal applicatioo 



Authorized officer 

Y. Mirinuavv.d. NUxA%*lw>d 



Form PCT/RCV134 (July 1992) 



For I n tern* (i cm* J Bureau use ooJy 



is tbeei wit received by the International Bureau on: 



Authorized office/ 
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American Type Culture Collection 



I I3«l * ft.wfc.Ui. MO H»5J ISa * Til >| imi; T*«« *MOU ATCCtOVt • F*X KHIMJ<» 



BUDAI'tS r TRKA'! Y ON THE INTERNATIONAL RECOGN fTION OF 
THE DP.POS1T Or MICROORGANISMS FOR THE PURPOSES OF PATENT PKOCFDURE 



INTERNA. TiONAL FORM 
RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT ISSUED PURSUANT TO RULE 7 3 
AND VIABILITY STATEMENT ISSUED PURSUANT TO RULE 10 2 



To: (Nam a and Address of Depositor or Attorney) 

Oba-Geigv Corporation 
Attention: Leslie 8 Frednch 
P.O Box 12257 

Research Triangle Park. NC 27709 
Oopo sited on Behalf of: Ciba-GeiQy Corporation 

Identification Reference by Depositor: ATCC Designation 

Casmid. D7 97736 

The oopasit was acconoamod hy a scientific description a proposed taxonomic dcscnpnon indicated 



The deposit was received SoptrmDrr 2^. I^Qn by this international Depositary Autnonty and has been accepted. 
AT YOUR REQUEST; 



The strain wiJI be mode available if a patent office signatory to the Budaoest 1 reatv certittec one's right to receive, 
or ii a U.S. Parent is issued cuing tne suam, ana ATCC is mstructoa by th< united States Parent &• Trademark 
OM ce or trie depositor to release said strain 

it the cutturo should 0<o or Do Oestroyoo dunng the elective term of the depniit, it shall be vOuf t espcnsibitity to 
replace it with, living culture or the samr 

T he suatn will be maintained tor a penod uf at least 30 vears after tn« date o< dooosit. and for a Derma ot ai least 
Itve years a*ter truj ntu^t recent reQue^i lor a sanipie. Tho United St.itos ano many other counters arr signatory 
to the BudaooM Trootv 

The viability of tne culture cued above was tested October 3. 1 006 On tiiat da r .e, the Culture was viable 
International Depository Authority: American Type Culture Collection. RoCKViiie, rvla 20852 uSa 
Signature of person having authority to represent ATCC: 



auove 



x 



We wiJt inform you at requests tor the strain for 30 years. 




Data: QciCwCf 1 9PC 



Andrea C Walsh Ph O 



CTITIITC CM 
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Applicant i or apenij Hie ! International appJtcal 

reference numoer 

PF, 21212/A/CCC 1909 



on r T/EP9 / / 0 1 2 J 



INDICATIONS R ELA11NC TO A UF.POSITIJI) MintOOUGANISM 

(PCT Rule 136i.v) 



A. The maicjuons made bc.ow relate to ine microorganism reterred to m the description 
on page line 



B. IDENTIFICATION OK DEPOSIT 



Funher deposits arc tUcnuhed on an additional sheet [ | 



Name o( ueposiijfv ins.iiuncn 



rican Typo Culture Collection <ATCC) 



Address oi deposnarv ir.sniuiion (including postoi code and country^ 

12301 Parklavn Drive 
Rockville, MD 20B52 
USA 



Date ol tJepus 



13 June 1996 (13.06.96) 



Accession Number 



97606 



C. ADDITIONAL INDICATIONS i team blank if not applicable) This information is con! mued on an additional ibeet Q 



We request the Expert Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MA D K (if the indications are not for all designated Stairs) 



E. SEPARATE FURNISHING OF INDICATIONS f/w« bUnk ,f applicable) 



The indication* listed below will be submit ted to the International Bureau later {specify the general nature of theirUtc*uo*3*\g^ 'Ac* 
Number of Deposit ') 



For receiving Office um only 



5^ T^** abeet was received with the international applicaii 

Y. Marinu*-v.d. Nouvi^iand 



Authorized officer 



For Interna ttooal Bureau ua« otuy 



I I ThU «i>e«t was received by the I menu t kraal Bureau 4 



Authorized officer 



Form PCT/RCV134 (July 1992) 
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American Type Culture Collection 

r*rfcU»« D«l«r ♦ R»cfcv11ft«. MO JMH USA • T«hphci*< (391)131 M1<J T#lti: lfl~S)s ATCCNOBTH • FAX: 



BUDAPEST TREATY ON THE INTERNATIONAL RECOGNITION OF 
THE DEPOSIT OF MICROORGANISMS FOR THE PURPOSES OF PATENT PROCEDURE 

f STERN A TIOSAL FORM 

RECEIPT IN THE CASE OF AN ORTGINAL DEPOSIT ISSUED PURSUANT TO RULE 7.3 
AND VIABILITY STATEMENT ISSUED PURSUANT TO RULE 10. 2 



To: (Name and Address of Depositor or Attorney) 

Ciba Geigy Corporation 
Attn; Leslie B. Fnednch 
P.O. Box 1 2257 

Research Triangle Park, NC 27709 
Deposited on Behalf of: Clba-Galfly Corporation 
Identification Reference by Depositor: 

Ptasmid P 1 1 6 



RECEIVED 

JUN 2 7 1996 

CIBA-GEIGY 
ABRU PATENT DEPT 



ATCC Designation 

97606 



a scientific description _ o proposed taxonomic description indicated 
The deposit was received .Innw 1 3 1 996 by this International Depositary Authority and has been accepted. 



The deposit was accompanied by 
above . 



AT YOUR REQUEST: 



We will inform you of requests for the stiain tor 30 years. 



The strain wiu be marje available if a patent office signatory to the Budapest Treaty certifies one's right to 
receive, or if a U S Patent is issued citing the strain, and ATCC is Instructed by the United States Patent & 
Trademark Office or the depositor to release said suem 

if the culture snouid die or be destroyed during the effective term oi the deposit, it snan be your responsrbiiuv 
to replace it wuh nvmg culturo ot the samo 

The strain will be rra-ni^ne6 for a penoo of at least 30 years from date of .Jeposit. or five years after the most 
recent reoueat fcr e sample, whichever is longer. The United States ana many other countries are signatory 
to the Budapast Treaty. 

The viability of the culture c;tea above was tested Iupa ?Q 1 996 On mar date the culture was viable. 
International Depository Authority: American Type Culture Collection, Rockville, Md. 20852 USA 



Signature of person having authority to represent ATCC: 
Barbara M. Hailay. Administrator, Patent Depository 



Date: j^nfl ?1 ■ 1396 



Andrea C. Walsh. Ph.D. 
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Applicant's or a genii file p 
reference numoc/ 



-21212/A/CGC 1909 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 



A. The indicauon_s rn.iuc bciuw re.aie io Lhe microorganism relerrcU to in ihe description 
on page , | t ne 



B. IDENTIFICATION OK DEPOSIT 



Furiher deposiis are identified on an additional sheet | ] 



Name o: depositary inMiiuiion 



Aaerican Type Culture Collection (ATCC) 



Address ol depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, MD 20852 
USA 



Dale ol (Jeposti 



08 May 1996 (08.05.96) 



Accession Number 



97543 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



We request the Expert Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MA DE(i///ic indications are not for mil designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (lea* blank if mot applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of ihei/UjcMt*o*s eg., 'Aco 
Number of Deposit') 



For receiving Office use only 



[X\ This sheet was received with the international application 



Authorized officer 

Y. Marinutw.d. Koow^knd 



For International Bureau use only 



I 1 This sheet was received by the International Bureau < 



Authorized officer 



Form PCT/R 0/134 (July 1992) 
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American Type Culture Collection 

11*1 r«>^«i Drt»« • Ro*Iit*««. MD 1*IS2 USA • T«Up*M»«: <>«] )2JI-9tt» T«fc«: l*-t»5 4KCNOSTH • FAX: J0I-TTO-1M7 

BUDAPEST TREATY ON THE INTERNATIONAL RECOGNITION OF <r i & A ' ' 

THE DEPOSIT OF MICROORGANISMS FOR THE PURPOSES OF PATENT PROCEDURE 

INTERNA TIONA L FORM 

RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT ISSUED PURSUANT TO RULE 7.3 
AND VIABILITY STATEMENT ISSUED PURSUANT TO RULE 10 2 



To: (Nam© and Addran of Depositor or Attorney} 

Cibe-Getgy Corporation 
Attn: Leslie B. Fnednch 
P.O. Box 1 2257 

Research Triangle Park, NC 27709 
Deposited on Behalf of: Ciba-Geigy Corporation 
Identification Reference bv Depositor: 
Piasmid, BAC4 



"OFJVED 

MAY 2 8 1996 

ClbA-GEJGY 
ABRU PATENT DEPT. 



ATCC Damnation 

971343 



The deposit was accompanied by: 
above . 



o scientific description _a prooosud taxonomic description Indicated 



The deposit was received Mav fl 1 996 by this International Depository Authority eno has been accepted. 

AT YOUR REQUEST: w « w<lt Inform you of roquosts for the strain for 30 years. 

The strain will bo made available if a patent ofttce signatory to the Budapest Treaty certifies one's right to 
receive, or if a U.S. Patent is issued citing the strain, end ATCC is instructed by the United States Patent & 
Trademark Office or the deposiror to release saio strain 

if the cukure should die or oe destroyed during the olfocttvo io»m of tho deposit, it shall b« your r esponstbihl y 
to rep'ace tt wun living culture of mo same 

The strain will be maintained for a period of at least 30 years from date of deposit, or Uvc years afterthe most 
recent request for a sample, whichever is longer. The United States and many other countries are signatory 
to the Budapest Treaty. 

The viability of the culture cted above was tested Mav 1 7 _ 1 996 On thaT date :ne culture was viable. 
International Depository Authority: American Type Culture Collection. Hockvitie. MO. ^0852 USA 
Sigpatura of parson having authority to represent ATCC: 

Data: May 20. 1996 



Sigoatura of parson having authority to 
Barbara M. Hatley, Administrator. Pa£n 



nt Depository 



Andrea C. Walsh. Ph.D. 
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What Is Claimed Is: 

1. An isolated DNA molecule comprising a NIM1 gene. 

2. An .solated DNA molecule according to claim 1, comprising the nucleotide sequence set 
forth in SEQ ID NO:2. 

3. An .solated DNA molecule of adout 9.9kb which encodes the NIM1 gene product. 

4. An isolated DNA molecule according to claim 1, comprising the nucleotide sequence set 
forth in SEQ ID NO:1. 

5 An isolated DNA molecule of claim 1 , encoding the amincaco seauence of the NIM1 
gene product set forth in SEQ ID NO:2. 

6. An isolated DNA molecule comprising a mutant gene of NIM1 of claim 1 . which ,s a niml 
gene. 

7. Clone BAC-04, ATCC Deposit No. 97543. 

8. A chimeric gene comprising a promoter active in plant operably linked to a heterologous 
DNA moiecule encoding the aminoacd sequence of a NIM1 gene product. 

9. A chimenc gene comprising a promotor active in plant operably linked to the 
heterologuous DNA fragment according to claim 3. 

10. A chimeric gene comprising a promotor active in plant operably linked to a 
heterologuous DNA molecule encoding the aminoacd sequence set forth ,n SEQ ID NO:2. 
1 1 A ch,mer.c gene comprising a promotor active in plant operably linked to a 
heterologuous DNA molecule encoding the aminoacid sequence of a n,m1 gene product. 

12. A recombinant vector comprising the chimeric gene of anyone of cla.ms 8 to 1 1 . 

13. A recombinant vector according to claim 12. wherein sa.d vector ,s capable of be.ng 
stably transformed into a host cell. 

14 A recombinant vector according to cla.m 1 2, wherein sa,d vector ,s capable of be.ng 
stably transformed into a plant, piant seeds, plant tissue or plant cell. 
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26. Use of an isolated DNA molecule according to claim 1 and variants thereof in a 
screening method for identifying compounds capable of inducing broad spectrum disease 
resistance in plants. 

27. Use of a plant phenotyp according to claim 17 to identify an isolated gene fragment 
which allows expression of broad spectrum of disease resistance in plants. 

28. Use of an isolated DNA molecule according to anyone of claims 1 to 5 to confer disease 
resistance to plant cells, plants and the progeny thereof. 

29. Use of an isolated DNA molecule according to claim 6 to confer universal disease 
susceptibility to plant cells, plants and the progeny thereof. 

30. Use of resistant plants and the progeny thereof according to claim 20 to incorporate the 
resistant trait into plant lines through breeding. 
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15. A piant expression cassette compr.s.ng a ch.menc gene of anyone of claims 8 to 1 1 . 

16. A plant express.on cassette compns.ng a ch.menc gene of claim 8 to 10. 

1 7. A piant express.on cassette comprising a chimeric gene of claim 1 1 . 

18. A plant expression cassette accord.ng to claim 15 to 17 which expresses the chimeric 

gene continously or constitutively. 

19. A plant, p.ant ceils and the progeny thereof comprising the chimeric gene of anyone of 



claims 8 to 11 , 



20. A plant, plant cells and the progeny thereof comnrtcinn tha „k 

H yyeny mereoT comprising the chimeric gene of anyone of 

cla,ms 8 to 10. wh,ch have a broad spectrum of disease resistance. 

21 . A plan,, plant cel.s and the progeny thereof comprising the chimeric gene of claim 1 1 . 

22. A plant, plan, cells and the progeny thereof of c.aim 19. where.n said plant is selected 
form the group consisting of gymnosperms, monocots. and dicots. 

23. A plant, plant ceils and the progeny thereof of Cairn 19. where.n sa.d plan, is a crop 



plant. 



24 A p,an,. plan, cells and ,he progeny ,hereo, of claim 23. wherem sa,d p,an, ,s seieced 
form ,he group cons,s„ng of nee. wheat, barley, rye. com. p o,a,o. carro,. swee, potato, 
sugar bee,, bean. pea. chicory, le.fuce. cabbage. cauliflower, brocco,,. tu rn,p. radish, 
spinach, asparagus, onion, game, eggpfant. pepper, celery, carrot, squash. pumpKin. 
zucchtn,, cucumber, appie. pear. c,u,nce. melon, plum, cherry, peach, necar.ne. apneo,. 
strawberry, grape, raspberry, blackberry, p.eappie. avocado, papaya, mango, banana, 
soybean, tobacco, tomato, sorghum and sugarcane 

25. Use of an isolated DNA molecule accord.ng ,c claim 1, gene ,o confer disease 
resistance m plants. 
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Schematic representation of the Nim flanking YACs with regard to the genetic map and the 
AFLP markers. Below the physical distance is indicated in n x 100 kb. Only the positions of 
markers W84.2 and WL84.4&5 were determined very accurately, the other markers can only be 
positioned within a certain interval. 
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Positions of P1 and BAC clones with respect to the flanking AFLP markers and YACs. The 
positions of clones P1-3 and P1-4 with respect to YACs 10G07 and 7E03 were not determined 
At the bottom the physical distance in kb's is indicated. The physical distance and extent of 
overlaps are best estimates and not exact values. 
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markers. Clones P1-3. P1-4 and P1-9 overlapped completely with YACs 10G07 and 7E03 and 
were not positioned with respect to these YACs. At the bottom the physical distance in kb's is 
indicated. The physical distance and extent of overlaps are best estimates and not exact values. 
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Schematic representation of all identified P1 and BAC clones and their relative positions in the 
Nim region. At the bottom the physical distance in kb's is indicated. The physical distance and 
extent of overlaps are best estimates and not exact values . 
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Integrated genetic and physical fine map of the NIM region. The scale on the bottom is in 
kb. 
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Integrated map of the NIM region. This map is a best fit of the AFLP fingerprinting and 
restriction mapping data. Shown are the relevant AFLP markers, the BAC and P1 clones, 
the cosmid contig, the sizes of the cosmid inserts (in parentheses) and some restriction 
sites. The scale on the bottom is in kb. 
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Integrated map of the NIM region including the new AFLP markers. This map is a best fit of 
the AFLP fingerprinting and restriction mapping data. Shown are the relevant AFLP 
markers, the BAG and P1 clones, the cosmid contig, the sizes of the cosmid inserts (in 
parentheses) and some restriction sites. The scale on the bottom is in kb. 
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Schematic representation of the genotypes of the most important recombinants D169 and 
C105. Shown is the recombinant chromosome with the Ler portion indicated as a black bar 
and the Ws portion as a gray bar (the non-recombinant chromosome is of the Ws-type). On 
top the positions and approximate distances of the relevant AFLP markers are shown 
related to the actual size in kb*s as shown at the bottom. 
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agtcaagtgc cttatgtaca ttcgagacat 
tccaaaccaa gccctccacg tctatcaatt 
cggcaatgac ttttcaagac taaccatatc 
tctctttaca tctatccaat agcttcatac 
gcaagcttga gtataaccaa tgtgtcctct 
gtccaataag tgcctctgaa atacaaatac 
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tccatcaaca ggctccgatg aattgaaatt 
gctaatcaac gaagagacct tatcaactgg 
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tgecaaatat ttaataaatt ttgtgtacca 
tatataegtg gacagcatac cgtacatata 
catagatacg ggttatattg gtaagctata 
agatattacg tgttgtgtet aaatatgtgt 
tctaatatat taaaatatct tttattaact 
aaattgggac actattttct atacagcaac 
aacccttgat a taa taaaat aactagecaa 
taaaacaatg tttggtatta ttacataaac 
ttccttttta ccttataaaa aacaattaaa 
ccccacaatg agegagecaa ttgagacttg 
ctacgtttgc atttgtegge ccattttttt 
ggcccgttgc ttcttccgtt cagatcaacc 
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aatt acaa ta 
ga:caattag 
aaatgac tag 
tacgtgt tac 



cggaaaacaa 
aantcaactt 
cgaaacaata 
aa:gc:ttta 
t 1 aaaaccca 
cgaa t agcac 
agagt tcgga 
ggatcacatt 
atcgggttgg 
taatctcttc 
t:ctgtgatt 
tgcgaaagcc 
taaacccaat 
gatactgttt 
gtatacttaa 
ctttatcatt 
aatcactcct 
atgtagtgtt 
aagat tgaca 
ggagcctgaa 
tggatcatgt 
aaagagatga 
agagaacaac 
aggaaagtat 
actaaccaaa 
aagccaaaac 
tgctagactc 
taactgtttt 
tttctatcat 
taataatcca 
aaaat tagca 
taacctacgt 
tcagttatct 
araacatat t 
taaatgtttc 
tataaacact 
taccacagtt 
t taggacatg 
tgaacaatag 



acgaaagaac 
c:ctgcgtt: 
caacgtcgt t 
atatctttca 
taactcgtga 
ctagcttccc 
gtatggaaga 
gctgttccgt 
aaaaaat :aa 
caggtaaacg 
caaatgg tag 
tttgtat ttt 
gacatatccc 
agtggttggt 
cacaaagcaa 
ccgggtggca 
c tcatggaca 
gtatgaagta 
aaccatgacc 
1 1 ttccgacc 
cctgat ttat 
gcacga tcac 
t tatgatgaa 
caaaatttgc 
agataaaatc 
tttatcttta 
aggaaacaac 



aat caga t cc 
atgt tgtaga 
get tggagtc 
c t ttaaccca 
acatc t tc 1 1 
t tcgtatc tg 
aggaccaagc 
gatatccaat 
tctgtctgtt 
aa tcagaat c 
ataccccgag 
ataccggaaa 
aaccct tcac 
ttgaagacta 
tatcccatga 
gaaagtcgat 
aaactggttc 
t cccgaaa ta 
cgtagct tct 
t ttgacgccg 
caccggagaa 
tgaa tgagaa 
taaagtgt t t 



aat tgacaat 
caagtcaat t 
caaaaaaga t 
aatttccatt 
tagtac tt tc 
gaaat taaaa 
atcacac taa 
gaagaaagt t 

aggggaagat 

aatgaaaact 
taacatc t ta 



t ttcatga 1 1 
gact tt taac 
t ttt ttt ttt 
aa taga ttaa 
tctt tctttt 
tcacca 1 1 1 1 
tatcattgtt 
t tccataga t 
ttagtt tcct 
gaagaaaaaa 
ctaataat t t 
tt tggcccac 
tctgat t taa 
tataatc t ca 
cgtgtt ttgt 
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7751 ccatatatt: gtcgcttaca 

7801 gcttttaaga agcacaaaac 

7851 accaaaaaaa aaaattaaat 

7901 cctacgtcga gccttgttga 

7951 cccgcccaga tcaaccctct 

8001 aagaggcaaa atcctcgttt 

8051 aagttgtaga ggcaaacatg 

8101 gcttggtatc cacgtaatat 

8151 ctttttccca cgacttttca 

8201 acatcctctt gattgttgtt 

8251 ttcgtagccg actaactctg 

8301 aagaccaagt rrcggttttg 

8351 gacccccaat gcaagaaccc 

8401 accgtctgt: ctcggtagac 



8451 


a a t cggaatc 


^ ^ ^ ^ s t~ t~ r*t 
aaaadL l_ u y 


8501 


ccaggcgat t 


t egaaagee t 


8551 


ggttaccgtt 


agacctaatg 


8601 


gtatgatctg 


atactgtt tt 


8651 


at tggtcttg 


tatcat tata 


8701 


agctttgatc 


tt tacctctc 


8751 


tgttatccaa 


ate tctcctc 


8801 


cctctttcgc 


agegt tc tag 


8851 


gagatttagg 


t tgacaaacc 


8901 


tattgatgag 




8951 


acctc:rgga 


tcgtgtcctg 


9001 


y y ddadady i 


a a g a a a g a g o> 


9051 


aatcaggatt 


ggtagagaac 


9101 


gtatcacaaa 


t tgee 1 1 1 1 1 


9151 


caaagat aaa 


at^^t^atca 


o -"i r\ i 


taaaaat ata 


t a t aaaa ta t 


92 51 




cttttgactt 


9301 


gacttagaaa 


acaacat tt t 


9351 


caatc t ttgg 


gtttctatag 


9401 


ctcattatat 


catatacaaa 


9451 


tt taaaaat t 


agcaagt ttg 


9501 


attaaac eta 


gtgat t tcca 


95 51 


t tag tact r t 


aaat t ttcat 


9601 


ac tccagt 1 1 


aacttatgt t 


9651 


gtaat aaata 


1 1 tt agct 1 1 



aatatattca cataacaatg tttgcattaa 
catataacaa aattaaatat tcctatccct 
attcctacag ccttgttgat tattttatgc 
etagtttgea ctcgtcggtc catttcttct 
cgtaatcaga acaaaagggg aaacaaacgt 
gtatgaacta agtt taact t c tctgtgt tt 
atcccaacta gaaagcatta egaegtegtt 
gctctactcc aatgetttea atatctttca 
aaactgetet ttaaaaccca taatctgtga 
tatccagtga cgaataacac ctagcctccc 
ggaataaacc aacgtttgga gtatgtaaga 
ggacataacc ggatcacatt gtggttccat 
tgaagcttgt accgggtttg aaagaattag 
gcaaattttt taatctcttc cacataaacg 
caegcaaaag ttctgagatt ccgagtcata 
aaatatttta taceggaaag gctgcaatcc 
acttatcaca actcctcact tttgggtttg 
gttgttggtt tgcagactat gtattccggt 
acaaagcaat atcccatgac gtgeatcaca 
cctgtggcag aaaatcgatg gagactcctt 
tcatggaaaa aactggtatc aagtttgtat 
gaagtatcca gagatattgt tggttcgatg 
aagactcgta gcttctcttg ttgeactett 
ccgatttegg acccccgaag ataagaaaga 
atttatcacc ggagaactca tgatcttatt 
tgagcacgat cagtgaatga gatatataga 
cgacgatgat gaatatacaa gtgtttataa 
ettegctagt cccaaaacaa gcaaattaac 
atgttttccc r t ttcttege cagtcccaga 
rrrattaggt tacttgtagt accttgagcc 
ttaaccaaat taacagtaat ttaatacjeta 
gtatatatat tctttgacat caaaattcaa 
tgtttttttt cttattctaa tagattacca 
gtgtttcctt ttcaatcaac atccattttc 
ttcttatatc atcattcagc agatttctta 
ttttgeaect atatgtttct ctttcttagt 
atatataatt tattaaaatt aaaagtaaaa 
aaatgtttca tcacactaaa agagcattaa 
atgaaaaaaa atatcaaatc actgaagaca 
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97C1 tttgttggcc tatactctac 

9751 tagtaaccca tatgatatct 

9801 tctaaagata gtaaaaacga 

9851 cctagaaaga gtaaggtaga 

9901 tcaagaagct ctcatcgat 



ttcttatttg gccaattagt aatagac taa 
crctaattcc ggcgaaacga atatcctgat 
at t t tgatga agggaatact at t tcacaca 
aacctttttt tttttggtca gattcttgta 
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